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(57) Abstract 

The invention concerns a nucleic material, in isolated or purified state, and a nucleotide fragment comprising a nucleotide sequence 
selected from the group consisting in (i) the sequences SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, SEQ ID NO: 120, SEQ 
ID NO. 124, SEQ ID NO: 130, SEQ ID NO: 141 and SEQ ID NO: 142, (ii) the complementary sequences of sequences (i); and (iii) the 
sequences equivalent to sequences (ii) and (iii), in particular the sequence having for every series of 100 contiguous monomers, at least 
50 %, preferably 70 % homology with sequences (i) and (ii) respectively. The invention also concerns their uses for detecting a retrovirus 
associated with multiple sclerosis and/or rheumatoid arthritis. 



(57) Abrege* 

Matdriel nucl6ique, a T6tat isol6 ou purifie\ et fragment nucldotidique, comprenant une sequence nucldotidique choisie dans le groupe 
qui consiste en (i) les sequences SEQ ID NO: 112, SEQ ID NO:l 14, SEQ ID NO: 117, SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 
130, SEQ ID NO: 141 et SEQ ID NO: 142, (ii) les sequences compl&nentaires des sequences (i); et (iii) les sequences equivalentes aux 
sequences (i) ou (ii), en particulier les sequences pr£sentant pour toute suite de 100 monomeres contigus, au moins 50 %, et preTerentiellement 
au moins 70 % d'homologie avec respectivement les sequences (i) ou (ii), et utilisations pour d€tecter un retrovirus associ6 a la sclerose en 
plaques et/ou la polyarthrite rhumatoide. 
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C.(suUe) DOCUMENTS CONSIDERES COMME PERTINENTS 
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DATABASE EMBLHTG 


1-18,22 




SEQ ID NO: HSAC64 






Clone humain BAC R6083M05 de 7q21-7q22, 






14 fevrier 1997 






PAULEY A: "The sequence of H. sapiens BAC 






clone RG083M05" 






XP002090413 






Comparez des nucleotides 31070-32453 de 
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ID NO: 130, nucleotides 35881-37340 de 
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Nfi-117 nnrlanf iHac "?7T?1 — "i7S7fi 

inu.ii/, nucieouioes j/oji o/o/o 






de RG083M05 avec nucleotides 70-620 de SEQ 
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human alpha/delta T-cell receptor locus 






with bacterial artificial chromosome 






c i ones 






XP002090414 






Comparez nucleotides 7150-7510 de HSU85196 






aved nucleotides 1-361 de SEQ ID N0:141 et . 
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1-3,5, 




7 fevrier 1997 


13-20, 




cite dans la demande 
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voir le document en entier 
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WO 95 21256 A (BIO MERIEUX S.A.) 


1-3,5, 




10 aout 1995 


13-20, 




cite dans la demande 


22-26 




voir le document en entier 
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no. des revendications visees 
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6. LA MANTIA ET AL.: "Identification and 

characterization of novel human endogenous 

retroviral sequences preferentially 

expressed in undifferentiated embryonal 

carcinoma cells" 

NUCLEIC ACIDS RESEARCH, 

vol. 19, no. 7, 1991, pages 1513-1520, 

XP002059255 

OXFORD GB 

voir le document en entier 

H. PERRON ET AL.: "Molecular 
identification of a novel retrovirus 
repeatedly isolated from patients with 
multiple sclerosis" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF USA , 

vol. 94, no. 14, 8 juillet 1997, pages 

7583-7588, XP002059256 

WASHINGTON US 

voir le document en entier 

WO 94 28138 A (UNIVERSITY COLLEGE LONDON) 
8 decembre 1994 

voir le document en entier 

WO 93 07259 A ( SCLEROSE-FORENINGEN) 
15 avril 1993 

voir le document en entier 



1-3,5, 
13-17, 
22-24 



1-3,5, 
13-20, 
22-26 



1-3,5, 
13-20, 
22-26 



1-3,5, 
13-20, 
22-26 
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Cadre I Observations - lorsqu'ii a ete estime que certaines revendications ne pouvaient pas faire I'objet d'une recherche 
(suite du point 1 de la premiere feuille) 



Conformement a ('article I7.2)a), certaine3 revendications n'ont pas fait I'objet d'une recherche pour les motifs suivants: 



Les revendications n 03 

se rapportent a un objet a I'egard duquel ('administration n'est pas tenue de proceder a la recherche, a savoir: 



2. Les revendications n 03 

se rapportent a des parties de la demande Internationale qui ne remplissent pas suffisamment les conditions prescrites pour 
qu'une recherche significative puisse etre effectuee, en particulier: 



3. \_ Les revendications n ^ 

sont des revendications dependantes et ne sont pas redigees conformement aux dispositions de la deuxieme et de la 
troisteme phrases de la regie 6.4. a). 



Cadre li Observations * lorsqu'ii y a absence d* unite de I'invention (suite du point 2 de la premiere feuille) 



L'administration char gee de la recherche Internationale a trouve plusieurs inventions dans la demande international, a savoir: 



voir feuille supplementaire 



1 . I y | Comme toutes les taxes additionnelles ont ete payees dans les delais par le deposant, le present rapport de recherche 
*-*»-• intern ationale porte sur toutes les revendications pouvant faire I'objet d'une recherche. 

Comme toutes les recherches portant sur les revendications qui s'y pretaient ont pu etre effectuees sans effort particulier 
justifiant une taxe additionnelle, l'administration n'a sollicite le paiement d'aucune taxe de cette nature. 



3. I I Comme une partie seulement des taxes additionnelles demandees a ete payee dans les delais par le deposant, le present 

' ' rapport de recherche international ne porte que 3ur les revendications pour lesquelles les taxes ont ete payees, a savoir 

les revendications n 00 



4. I Aucune taxe additionnelle demandee n'a ete payee dans les delais par le deposant. En consequence, le present rapport 

' ' de recherche in tern ationale ne porte que sur I'invention mentionnee en premier lieu dan3 les revendications; elle est 

couverte par les revendications n °* 



Remarque quant a la reserve 



| X | Les taxes additionnelles etaient aocompagnees d'une reserve de la part du deposan 
[ | Le paiement des taxes additionnelles n'etait assorti d'aucune reserve. 
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SUITE DES RENSEIGNEMENTS INDIQUES SUR PCT/ISA/ 210 



L'admini stration chargee de la recherche Internationale a trouve 
plusieurs (groupes d') inventions dans la demande Internationale, a 
savoir: 

1. revendi cations: 5 (completement) ,1-3, 13-20, 22-26 (partiel lement) 

Materiel nucleique comprenant une sequence SEQ ID NO: 112, ou 
codant pour une peptide avec la sequence SEQ ID NO: 113, 
peptides correspondants, compositions comprenant un fragment 
desdites sequences et leur utilisation dans un procede pour 
detecter un retrovirus associe a la sclerose en plaques. 



2. revendications: 10 (completement) 1, 2, 7-9, 11-26 (partiel lement) 

Corrnie pour l'invention 1 mais se rapportant a SEQ ID N0:114, 
SEQ ID N0:115, SEQ ID N0:117, SEQ ID N0:118, SEQ ID N0:120 
et SEQ: ID: N0:121. 



3. revendications: 1, 3, 13-15, 18, 19, 21-26 (toutes partiel lement) 
Comme pour l'invention 1 mais se rapportant a SEQ ID N0:124. 



4. revendications: 1, 2, 4, 6, 13-19, 21-26 (toutes partiel lement) 

Conrme pour l'invention 1 mais se rapportant a SEQ ID N0:130, 
SEQ ID N0:131, SEQ ID N0:135 et SEQ ID N0:137. 



5. revendications: 1, 11, 13-15, 18-20, 22-26 (toutes partiellement) 

Connie pour l'invention 1 mais se rapportant a SEQ ID N0:141 
et SEQ ID N0:142. 
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International Filing Date 


The undersigned requests that the present 

international application be processed 
according to the Patent Cooperation Treaty. 


Name of receiving Office and "PCT International Application" 




Applicant's or agent's file reference 

(if desired) (J 2 characters maximum) MD/B05B2867 


Box No. I TITLE OF INVENTION 

RETROVIRAL NUCLEIC MATERIAL AND NUCLEOTIDE FRAGMENTS, IN PARTICULAR ASSOCIATED WITH 
MULTIPLE SCLEROSIS AND/OR RHEUMATOID ARTHRITIS, FOR DIAGNOSTIC, PROPHYLACTIC AND 
THERAPEUTIC USES 


Box No. II APPLICANT 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the address 
indicated in this Box is the applicants State (that is, country) of residence if no State of residence is 
indicated below.) 

BIO MERIEUX 
Chemin de I'Orme 
69280 MARCY L ETOILE 
FRANCE 


| | This person is also inventor. 

Telephone No. 
Facsimile No. 
Teleprinter No. 


State (that is, country) of nationality: 

FRANCE 


State (that is, country) of residence: 

FRANCE 


This person is applicant | 1 all designated all designated States except the i i the United States i 1 the States indicated in 

for the purposes of: 1 1 States l/W United States of America I I of America only I I the Supplemental Box 


Box No. Ill FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the address 
indicated in this Box is the applicants State (that is, country) of residence if no State of residence is 
indicated below.) 

PARAHNOS-BACCALA Glaucia 
75 Cours Duguesclin 
69003 LYON 
FRANCE 


This person is: 

| | applicant only 

\X\ applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 

FRANCE 


State (that is, country) of residence: 

FRANCE 


This person is applicant j 1 all designated i i all designated States except the United States i 1 the States indicated in 

for the purposes of: 1 1 States 1 I the United States of America l^Nl of America only I I the Supplemental Box 


P><] Further applicants and/or (further) inventors are indicated on a continuation sheet. 


Box No. IV AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 


The person identified below is hereby/has been appointed to act on behalf 

of the applicant(s) before the competent International Authorities as: agent | | common representative 


Name and address: (Family name followed by given name; for a legal entity, full official 

designation. The address must include postal code and name of country.) 

CABINET GERMAIN & MAUREAU 
B.P. 6153 

69466 LYON CEDEX 06 
FRANCE 


Telephone No. 

04 72 69 84 30 

Facsimile No. 

04 72 69 84 3 1 

Teleprinter No. 


| | Address for correspondence: Mark this check-box where no agent or common representative is/has been appointed and the 
space above is used instead to indicate a special address to which correspondence should be sent 



Form PCT/RO/101 (first sheet) (July 1998) . See Notes to the request form 



Sheet No. 2 



Continuation of Box No. Ill FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 


If none of the following sub-boxes is used, this sheet should not be included in the request 


Name and address: (Family name followed by given name; for a legal entity, full official 

designation. The address must include postal code and name of country. The country of the 

address indicated in this Box is the applicant's State (that is, country) of residence if no State 

of residence is indicated below.) 

KOMURJAN-PRADEL Florence 

1 14 Chemin du Pavilion 

69250 POLEYMIEUX AU MONT D'OR 

FRANCE 


This person is: 

| | applicant only 

E><3 applicant and inventor 

| | inventor only (If this check-box 
is marked, do not Jill in below.) 


State (that is, country) of nationality: 
FRANCE 


State (that is, country) of residence: 
FRANCE 


This person is applicant , . all designated . . all designated States except the United States of . . the States indicated in 

for the purposes of: 1 1 States | | the United States of America America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 

designation. The address must include postal code and name of country. The country of the 

address indicated in this Box is the applicant's State (that is, country) of residence if no State 

of residence is indicated below.) 

BEDIN Frederic 

6 Rue Gaspard Andre 

69002 LYON 

FRANCE 


This person is: 

| [ applicant only 

[X] applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: FRANCE 


State (that is, country) of residence: FRANCE 


This person is applicant ■ t all designated . . all designated States except . the United States of . > the States indicated in 

for the purposes of: 1 1 States | | the United States of America |/\| America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicants State (that is, country) of residence if no State 
of residence is indicated below.) 
SODOYER Mireille 
5 rue du Brulet 

69 110 SAINTE FOY LES LYON 
FRANCE 


This person is: 

| | applicant only 

applicant and inventor 

| [ inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: FRANCE 


State (that is, country) of residence: FRANCE 


This person is applicant , . all designated . — . all designated States except the United States | — . the States indicated in 

for the purposes of: I J States | | the United States of America |/\| of America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 

designation. The address must include postal code and name of country. The country of the 

address indicated in this Box is the applicant's State (that is, country) of residence if no State 

of residence is indicated below.) 

OTT Catherine 

103 Avenue Berthelot 

69007 LYON 

FRANCE 


This person is: 

| | applicant only 

^ applicant and inventor 

| [ inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: FRANCE 


State (that is, country) of residence: FRANCE 


This person is applicant . i all designated . — . all designated States except the United States | — . the States indicated in 

for the purposes of: 1 1 States | | the United States of America |/\| of America only [ | the Supplemental Box 


Further applicants and/or (further) inventors are indicated on another continuation sheet. 



Sheet No. 3 



Continuation of Box No. Ill FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 


If none of the following sub-boxes is used, this sheet should not be included in the request. 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant's State (that is, country) of residence if no State 
of residence is indicated below.) 
MALLET Francois 
84 rue Anatole France 
69100 V1LLEURBANNE 
FRANCE 


This person is: 

1 I applicant only 

[X] applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 
FRANCE 


State (that is, country) of residence: 
FRANCE 


This person is applicant | , all designated . — . all designated States except the United States of ( . the States indicated in 

for the purposes of: 1 1 States I I tne United States of America L/\] America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant's State (that is, country) of residence if no State 
of residence is indicated below.) 
PERRON Herve 
15 rue de Boyer 
69005 LYON 
FRANCE 


This person is: 

| [ applicant only 

\)^\ applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: FRANCE 


State (that is, country) of residence: FRANCE 


This person is applicant | . all designated . . all designated States except the United States of . . the States indicated in 

for the purposes of: 1 1 States | | the United States of America p\] America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 

designation. The address must include postal code and name of country. The country of the 

address indicated in this Box is the applicant's State (that is, country) of residence if no State 

of residence is indicated below.) 

MANDRAND Bernard 

21 rue de la Doua 

69100 VILLEURBANNE 

FRANCE 


This person is: 

| | applicant only 

applicant and inventor 

| [ inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: FRANCE 


State (that is, country) of residence: FRANCE 


This person is applicant i i all designated . . all designated States except . the United States . . the States indicated in 

for the purposes of: 1 1 States | | the United States of America [/\| of America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country. The country of the 
address indicated in this Box is the applicant's State (that is, country) of residence if no State 
of residence is indicated below.) 


This person is: 

| | applicant only 

| | applicant and inventor 

| | inventor only (If this check-box 
is marked, do not fill in below.) 


State (that is, country) of nationality: 


State (that is, country) of residence: 


This person is applicant | . all designated | ( all designated States except . > the United States . . the States indicated in 

for the purposes of: 1 1 States | | the United States of America [ j of America only [ | the Supplemental Box 


| | Further applicants and/or (further) inventors are indicated on another continuation sheet. 



Sheet No. 4 



Box No. V DESIGNATION OF STATES 



The following designations are hereby made under Rule 4.9(a) (mark the applicable check-boxes; at least one must be marked): 



Regional Patent 
AP 



National 

B 

El 
£3 

El 

El 
EI 
El 
El 
El 
El 
El 
El 
El 
El 
El 
El 
El 
El 
El 

El 
El 
El 
El 
El 
El 
El 
El 
El 

El 
El 
El 
El 
El 



EA 



EP 



OA 



ARIPO Patent: GH Ghana, GM Gambia, KE Kenya, LS Lesotho, MW Malawi, SD Sudan, SZ Swaziland, 
UG Uganda, ZW Zimbabwe, and any other State which is a Contracting State of the Harare Protocol and of the PCT 
Eurasian Patent: AM Armenia, AZ Azerbaijan, BY Belarus, KG Kyrgyzstan, KZ Kazakhstan, MD Republic of 
Moldova, RU Russian Federation, TJ Tajikistan, TM Turkmenistan, and any other State which is a Contracting State 
of the Eurasian Patent Convention and of the PCT 

European Patent: AT Austria, BE Belgium, CH and LI Switzerland and Liechtenstein, CY Cyprus, DE Germany, 
DK Denmark, ES Spain, FI Finland, FR France, GB United Kingdom, GR Greece, IE Ireland, IT Italy, 
LU Luxembourg, MC Monaco, NL Netherlands, PT Portugal, SE Sweden, and any other State which is a Contracting 
State of the European Patent Convention and of the PCT 

OAPI Patent: BF Burkina Faso, BJ Benin, CF Central African Republic, CG Congo, CI Cote d'lvoire, 
CM Cameroon, GA Gabon, GN Guinea, ML Mali, MR Mauritania, NE Niger, SN Senegal, TD Chad, TG Togo, and 
any other State which is a member State of OAPI and a Contracting State of the PCT (if other kind of protection or 
treatment desired, specify on dotted line) 



Patent (if other kind of protection or treatment desired, 

AL Albania 

AM Armenia 

AT Austria 

AU Australia 

AZ Azerbaijan 

BA Bosnia and Herzegovina 

BB Barbados 

BG Bulgaria 

BR Brazil 

BY Belarus 

CA Canada 

CH and LI Switzerland and Liechtenstein 

CN China 

CU Cuba 

CZ Czech Republic .- 

DE Germany 

DK Denmark 

EE Estonia 

ES Spain 

FI Finland 

GB United Kingdom 

GE Georgia 

GH Ghana 

GM Gambia... 

HR Croatia 

HU Hungary 

ID Indonesia 

IL Israel 

IS Iceland 

JP Japan 

KE Kenya 

KG Kyrgyzstan 

KP Democratic People's Republic of Korea 



specify on dotted line): 

^ LS Lesotho 

LT Lithuania 

LU Luxembourg 
^ LV Latvia 

MD Republic of Moldova 

Efl MG Madagascar 

[>3 MK The former Yugoslav Republic of Macedonia 



£3 MN 
MW 

g| MX 

[3 NO 

^ NZ 

E3 PL 

El pt 

[3 RO 

gj RU 

El SD 

El SE 

El sg 

El si 

El sk 

12 SL 

EI TJ 

E] TM 

El TR 

E| ua 

H UG 

El us 



Mongolia 

Malawi 

Mexico 

Norway 

New Zealand 

Poland 

Portugal 

Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 

Slovenia 

Slovakia 

Sierra Leone 

Tajikistan 

Turkmenistan 

Turkey 

Trinidad and Tobago 

Ukraine 

Uganda 

United States of America.. 



KR Republic of Korea.. 

KZ Kazakhstan 

LC Saint Lucia 

LK Sri Lanka 

LR Liberia 



^ UZ Uzbekistan 

^ VN Viet Nam 

^ YU Yugoslavia 

^ ZW Zimbabwe 

Check-boxes reserved for designating States (for the purposes of 
a national patent) which have become party to the PCT after 
issuance of this sheet: 

□ 

□ 



Precautionary Designation Statement: In addition to the designations made above, the applicant also makes under Rule 4.9(b) all 
other designations which would be permitted under the PCT except the designation(s) indicated in the Supplemental Box as being 
excluded from the scope of this statement. The applicant declares that those additional designations are subject to confirmation and 
that any designation which is not confirmed before the expiration of 1 5 months from the priority date is to be regarded as withdrawn 
by the applicant at the expiration of that time limit. (Confirmation of a designation consists of the filing of a notice specifying that designation 
and the payment of the designation and confirmation fees. Confirmation must reach the receiving Office within the 1 5 -month time limit.) 
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Box No. VI PRIORITY CLAIM 



| | Further priority claims are indicated in the Supplemental Box. 



Filing date 
of earlier application 
{day/month/year) 



Number 
of earlier application 



Where earlier application is: 



national application: 
country 



regional application: 
regional Office 



international application: 
receiving Office 



item(l) 

7 July 1997 



97 08816 



FRANCE 



item (2) 



item (3) 



The receiving Office is requested to prepare and transmit to the International Bureau a certified copy of the earlier 
application(s) (only if the earlier application was filed with the Office which for the purposes of the present international 
application is the receiving Office) identified above as item(s): {jJ 

* Where the earlier application is an ARIPO application, it is mandatory to indicate in the Supplemental Box at least one country party to the 
Paris Convention for the Protection of Industrial Property for which that earlier application was filed (Rule 4. 10(b)(ii)). See Supplemental Box. 



Box No. VII INTERNATIONAL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) 

(if two or more International Searching Authorities are 
competent to carry out the international search, indicate 
the Authority chosen; the two-letter code may be used): 

ISA/ 



Request to use results of earlier search: reference to that search (if an earlier 
search has been carried out by or requested from the International Searching 
Authority): 



Date (day/month/year) 



Number 



Country (or regional Office) 



Box No. VIII CHECK LIST; LANGUAGE OF FILING 



This international application contains 
the following number of sheets: 

request :5 
description (excluding 
sequence listing part) :38 

claims :5 

abstract : 1 

drawings :32 
sequence listing part 

of description :20 



Total number of sheets : 1 0 1 



This international application is accompanied by the item(s) marked below: 

1 . Q fee calculation sheet 

2. Q separate signed power of attorney 

3. [>3 copy of general power of attorney; reference number, if any: 

4. □ statement explaining lack of signature 

5. □ priority document(s) identified in Box No. VI as item(s): 

6. □ translation of international application into (language): 

7. □ separate indications concerning deposited microorganism or other biological material 

8. □ nucleotide and/or amino acid sequence listing in computer readable form 

9. □ other (specify): 



Figure of the drawings which 
should accompany the abstract: 


Language of filing of the 

international application: FRENCH 


Box No. IX SIGNATURE OF APPLICANT OR AGENT 



Next to each signature, indicate the name of the person signing and the capacity in which the person signs (if such capacity is not obvious from 
reading the request). 
CABINET GERMAIN & MAUREAU 



Dominique GUERRE 

CPI 32 1 1 04 Lyon, 7 July 1 998 



For receiving Office use only 



1. 


Date of actual receipt of the purported 
international application: 




2. 


Drawings: 


3. 


Corrected date of actual receipt due to later but 
timely received papers or drawings completing 
the purported international application: 




□ 


received: 


4. 


Date of timely receipt of the required corrections 
under PCT Article 11(2): 




□ 


not received: 


5. 


International Searching Authority 

(if two or more are competent): ISA / 


g 1 1 Transmittal of search copy delayed 
1 — ' until search fee is paid 







For International Bureau use only 



Date of receipt of the record copy 
by the International Bureau: 



TRAITE C OPERATION EN MATIERE Dt. JF 'ETS 



PCT 



RAPPORT DE RECHERCHE INTERNATIONALE 
(article 18 et regies 43 et 44 du PCT) 



Reference du dossier du deposant ou 
du man data ire 

MD/B05B2867 



POUR SUITE vo ' r ' a notification de transmission du rapport de recherche intern at ion ale 
(formulai re PCT/IS A/220) et, le cas echeant, le point 5 ci-apres 

A DONNER 



Demande Internationale n° 

PCT/ FR 98/ 01460 



Date du depot international (jour/mois/annee) 

07/07/1998 



(Date de priorite (la plus ancienne) 
(jour/mois/annee) 

07/07/1997 



Deposant 



BIO MERIEUX et al . 



Le present rapport de recherche internationale, etabli par ('administration chargee de la recherche internationale, est transmis au 
deposant conformement a I'article 1 8. Une copie en est transmise au Bureau international. 



Ce rapport de recherche internationale comprend . 



feuilles. 



fy"[ |t est aussi accompagne d'une copie de chaque document relatif a I'etat de la technique qui y est cite. 



1 Q II a ete estime que certain es re vend teat ions ne pouvaient pas fair© I'objet d'une recherche (voir le cadre I). 

2. | )( | || y a absence d'unite de I'invention (voir le cadre II). 

3 [v"! La demande internationale contient la divulgation d'un listage de sequence de nucleotides ou d'acides amines et la 

***** recherche internationale a ete effectuee sur la base du listage de sequence 

| y | depose avec la demande internationale 

[ | foumi par le deposant separement de la demande internationale 

I I sans etre accompagnee d'une declaration selon laquelle it n'inclut pas d'elements 
— allant au-dela de la divulgation faite dans la demande internationale telle 
qu'elle a ete deposee. 

| | transcrit par ('administration 



4. En ce qui conceme le titre, [^] le texte est approuve tel qu'il a ete remise par le deposant. 

| [ Le texte a ete etabli par I'administration et a la teneur suivante: 



5. En ce qui conceme I'abrege, 



le texte est approuv6 tel qu'il a ete remts par le deposant 

I I le texte (reproduit dans le cadre III) a ete etabli par Tad mini strati on conformement a la 
— regie 38.2b). Le deposant peut presenter des observations a I' administration dans un delai 
d'un mois a compter de la date d'expedition du present rapport de recherche internationale. 



6. La figure des dessins a pubfier avec I'abrege est la suivante: 

Figure n° suggeree par le deposant. 

| | parce que le deposant n'a pas suggere de figure. 
| | parce que cette figure caracterise mieux Tinvention. 



SI 



Aucune des figures 
n'est a publier. 



9 PCT/ISA/210 fcremiere feuille) Quillet 1992) 



RAPPORT DE RECHERCHE INTERNATIONALE 



vnande intemationale n° 

PCT/FR 98/01460 



Cadre I Observations - lorsqu'il a ete est i me que certaines revendi cations ne pouvaient pas laire I'objet d une recherche 
(suite du point 1 de la premiere feuille) 

Conformement a ('article 17.2)a), certaines revendications n'ont pas fait I'objet d'une recherche pour les motifs suivants: 

1 [ Les revendications n 05 

se rapportent a un objet a i'egard duque! ('administration n'est pas tenue de proceder a la recherche, a savoir: 



2 Les revendications n 05 _^ 

se rapportent a des parties de la demande intemationale qui ne remplissent pas suffisamment les conditions prescntes pour 

qu'une recherche significative puisse etre effectuee, en particulier: 



3. Les revendications n os _ 

sont des revendications dependantes et ne sont pas redigees conformement aux dispositions de la deuxieme et de la 

troisieme phrases de la regie 6.4.a). 

Cadre II Observations - lorsqu'il y a absence d' unite de ('invention (suite du point 2 de la premiere feuille) 

L'administration chargee de la recherche intemationale a trouve plusieurs inventions dans la demande intemationale, a savoir: 

voir feuille supplemental re 



□ 



Comme toutes les taxes additionnelles ont ete payees dans les delais par le deposant, le present rapport de recherche 
intemationale porte sur toutes les revendications pouvant faire I'objet cfune recherche. 

Comme toutes les recherches portant sur les revendications qui s'y pretaient ont pu etre effectuees sans effort particulier 
justifiant une taxe additionnelle, I'administration n'a sollicite le paiement d'aucune taxe de cette nature. 



□ Comme une partie seulement des taxes additionnelles demandees a ete payee dans les delais par le deposant, le present 
rapport de recherche intemationale ne porte que sur les revendications pour lesquelles les taxes ont ete payees, a savoir 
les revendications n 08 



4. I I Aucune taxe additionnelle demandee n'a ete payee dans les.delais par le deposant. En consequence, le present rapport 

' 1 de recherche intemationale ne porte que sur I' invention mentionnee en premier lieu dans les revendications; elle est 

couverte par les revendications n 08 



Remarque quant a ta reserve | X | Les taxes additionnelles etaient accompagnees d'une reserve de la part du deposan 

j j Le paiement des taxes additionnelles n'etait assorti d'aucune reserve. 



For mu laire PCT/1SA/21 0 (suite de la premiere feuille (1)) (Juiilet 1992) 



Demande Internationale No. PCT/FR 98/01460 



SUITE DES RENSEIGNEMENTS INDIQUES SUR PCT/1SA/ 210 

L' administration chargee de la recherche internationale a trouve 
plusieurs (groupes d') inventions dans la demande internationale, a 
savoir: 

1. revendications: 5 (completement) ,1-3, 13-20, 22-26 (partiel lement) 

Materiel nucleique comprenant une sequence SEQ ID N0:112, ou 
codant pour une peptide avec la sequence SEQ ID NO: 113, 
peptides correspondants, compositions comprenant un fragment 
desdites sequences et leur utilisation dans un procede pour 
detecter un retrovirus associe a la sclerose en plaques. 



2. revendications: 10 (completement) 1, 2, 7-9, 11-26 (partiel lement) 

Comme pour l'invention 1 mais se rapportant a SEQ ID N0:114, 

SEQ ID N0:115, SEQ ID N0:117, SEQ ID N0:118, SEQ ID NO:120 
et SEQ: ID: N0:121. 



3. revendications: 1, 3, 13-15, 18, 19, 21-26 (toutes partiel lement) 
Comne pour V invention 1 mais se rapportant a SEQ ID NO: 124. 



4. revendications: 1, 2, 4, 6, 13-19, 21-26 (toutes partiel lement) 

Connie pour l'invention 1 mais se rapportant a SEQ ID NO:130, 
SEQ ID N0:131, SEQ ID N0:135 et SEQ ID N0:137. 

5. revendications: 1, 11, 13-15, 18-20, 22-26 (toutes partiel lement) 

Conine pour l'invention 1 mais se rapportant a SEQ ID N0:141 
et SEQ ID N0:142. 



RAPPORT DE CHERCHE INTERNATIONALE 



Of -*ande Internationale No 

r, ,/FR 98/01460 



A CLASSEMENT DE L'OBJET DE LA DEMANDE, 

CIB 6 C12N15/48 C12Q1/70 C07K14/15 A61K31/70 



Selon la classification international des brevets (CIB) ou a la tois selon la classification nationale el la CIB 



B. DOMAINES SUR LESQUELS LA RECHERCHE A PORTE 



Documentation minimale consultee (systeme de classification suivi des symboles de classement) 

CIB 6 C12N C07K C12Q A61K 



Documentation consultee autre que la documentation minimale dans la mesure ou ces documents relevent des domaines sur lesquels a porte la recherche 



Base de donnees electronique consultee au cours de la recherche Internationale (nom de la base de donnees. et si realisable, termes de recherche utilises) 



C. DOCUMENTS CONSIDERES COMME PERTINENTS 



Categorie ° Identification des documents cites, avec, le cas echeant, I'indication des passages pertinents 



no. des revendtcations visees 



p,x 



WO 98 23755 A (BIO MERIEUX) 4 juin 1998 
voir le document en entier 

-/- 



1-26 



suite du cadre C pour la fin de la liste des documents 



ID 



Les documents de families de brevets sont indiques en annexe 



0 Categories speciales de documents cites: 

"A" document definissant I'etat general de la technique, non 

considere comme particulierement pertinent 
"E" document anteneur, mais publie a la date de dep6t international 

ou apres cette date 
"L" document pouvant jeter un doute sur une revendication de 

priorite ou cite pour determiner la date de publication d'une 

autre citation ou pour une raison speciale (telle qu'indiquee) 
"O" document se referant a une divulgation orale, a un usage, a 

une exposition ou tous autres moyens 
"P" document publie avant la date de dep6t international, mais 

posterieurement a la date de priorite revendiquee 



"T" document ulterieur publie apres la date de depot international ou la 
date de priorite et n'appartenenant pas a I'etat de la 
technique pertinent, mais crte pour comprendre le principe 
ou la theorie constituant la base de I'invention 

"X" document particulierement pertinent; I'inven tton revendiquee ne peut 
etre consideree comme nouvelle ou comme impliquant une activite 
inventive par rapport au document considere isolement 

"Y" document particulierement pertinent; I'inven tion revendiquee 

ne peut etre consideree comme impliquant une activite inventive 
lorsque le document est associe a un ou plusieurs autres 
documents de meme nature, cette combinaison etanl evidente 
pour une personne du metier 

'&" document qui fait partie de la meme lamille de brevets 



Date a laquelle la recherche internationaie a ete effectivement achevee 



29 Janvier 1999 



Date d'expedition du present rapport de recherche internationaie 

1 7. 01 99 



Nom et adresse postale de I'administration chargee de la recherche Internationale 
Office Europeen des Brevets. P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Fonctionnaire autorise 



Cupido, M 



RAPPORT DE -CHERCHE INTERNATIONALE 



P~-*iande Internationale No 

. -f/FR 98/01460 
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1 . Le present rapport d'examen preliminaire international, etabli par I'administaration chargee de I'examen preliminaire 
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2. Ce RAPPORT comprend 1 1 feuilles, y compris la presente feuille de couverture. 

H II est accompagne d'ANNEXES, c'est-a-dire de feuilles de la description, des revendications ou des dessins qui ont 
ete modifiees et qui servent de base au present rapport ou de feuilles contenant des rectifications faites aupres de 
I'administration chargee de I'examen preliminaire inte/iational (voir la regie 70.16 et ('instruction 607 des Instructions 
administratives du PCT). 
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t. Base du rapport 

1 . Ce rapport a ete redige sur la base des elements ci-apres (les feuilles de remplacement qui ont ete remises a 
I'office recepteur en reponse a une invitation faite conformement a i'articie 14 sont considerees, dans le present 
rapport, comme "initialement deposees" et ne sont pas jointes en annexe au rapport puisqu'elles ne contienneni 
pas de modifications.) : 

Description, pages: 

1-38 version initiaie 



Revendications, N°: 

^28 resue(s) le 18/10/1999 avec la lettre du 13/10/1999 

Dessins, feuilles: 

1/32-32/32 version initiaie 



2. Les modifications ont entraine I'annulation : 

□ de la description, pages : 

□ des revendications, n os : 

□ des dessins, feuilles : 

3 □ Le present rapport a ete formule abstraction faite (de certaines) des modifications, qui ont ete considerees 
comme allant au-dela de i'expose de I'invention tel qu'il a ete depose, comme il est indique ci-apres 
(regie 70.2(c)) : 



4. Observations complementaires, le cas echeant : 



II. Priorite 

1 . □ Le present rapport a ete formulee comme si aucune priorite n'avait ete revendiquee, du fait que 

documents suivants n'ont pas ete remis dans le delai present : 

□ copie de la demande anterieure dont la priorite a ete revendiquee. 

□ traduction de la demande anterieure dont la priorite a ete revendiquee. 

2. □ Le present rapport a ete formulee comme si aucune priorite n'avait ete revendiquee, du fait que 

Formulaire PCT/tPEA/409 (cadres I- VIII. feuille 1) Garwier 1994) 
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revendication de la priorite a ete jugee non valable. 

Pour les besoins du present rapport, la date de depot international indiquee plus haut est done 
consideree comme la date pertinente. 

3. Observations complementaires, le cas echeant : 
voir feuille separee 



IV. Absence d'unite de I'invention 

1 . En reponse a I'invitation a limiter les revendications ou a payer des taxes additionnelles, le deposant a 

limite les revendications. 

□ paye des taxes additionnelles. 

□ paye des taxes additionnelles sous reserve. 

□ ni limite les revendications ni paye des taxes additionnelles. 

2. □ L'administration chargee de I'examen preliminaire international estime qu'il n'est pas satisfait a I'exigence 

d'unite d'invention et decide, confomnement a la regie 68.1 , de ne pas inviter le deposant a limiter les 
revendications ou a payer des taxes additionnelles. 

3. L'administration chargee de I'examen preliminaire international estime que, aux termes des regies 13.1 ,13.2 et 
13.3, 

□ il est satisfait a I'exigence d'unite de I'invention. 

K il n'est pas satisfait a I'exigence d'unite de I'invention, et ce pour les raisons suivantes : 
voir feuille separee 

4. En consequence, les parties suivantes de la demande intemationale ont fait I'objet d'un examen preliminaire 
international lors de la formulation du present rapport : 

□ toutes les parties de la demande. 

El les parties relatives aux revendications n os 7-10 et 12 (en entier) et 1 , 2. 11 et 13-28 (partiellemnt). 
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V. Declaration motivee selon (article 35(2) quant a la nouveaute, lactivite inventive et la possibilite 
dapplication industrielle; citations et explications a I'appui de cette declaration 

1. Declaration 

Nouveaute Oui : Revendications 7-1 0 et 1 2 (en entier) et 1 , 2, 1 1 et 1 3-28 

(partiellement) 

Non : Revendications 

Activite inventive Oui: Revendications 

Non : Revendications 7-10 et 12 (en entier) et 1 , 2, 1 1 et 13-28 

(partiellement) 

Possibilite ^application industrielle Oui : Revendications 7-10 et 12 (en entier) et 1, 2, 1 1 et 13-28 

(partiellement) 

Non : Revendications 



2. Citations et explications 
voir feuille separee 



VI. Certain documents cites 

1 . Certains documents publies (regie 70.10) 
et / ou 

2. Divulgations non ecrites (regie 70.9) 
voir feuille separee 



VIII. Observations relatives a la demande intemationale 

Les observations suivantes sont faites au sujet de la clarte des revendications, de la description et 
et de la question de savoir si les revendications se fondent entierement sur la description : 

voir feuille separee 
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SECTION I 

1. Les revendications modifiees 1-28, deposees avec la lettre du 13.10.99, sont 
admissibles selon la Regie 70.2(c) PCT. 



SECTION II 



2. Le present Rapport d'Examen Preliminaire International a ete etabli en ayant pour 
hypothese que la date de priorite 07.07.97 de la presente demande est valablement 
revendiquee. Le document Proceedings of the National Academy of Sciences of 
USA, Vol. 94, No. 14, 8 Juillet 1997, pages 7583-7588, cite comme document "P" 
dans le rapport de recherche n'a done pas ete considere comme compris dans I'etat 
de la technique selon le reglement d'execution (Regie 64 (1)-(3) PCT). 



SECTION IV 

3. Absence d'Unite d'invention (Article 34(3) PCT et Regies 1 3 et 68 PCT) 

3.1. L'administration chargee de I'examen preliminaire international estime que la 
presente demande intemationale ne satisfait pas a I'exiaence d'unite d'invention 
(Article 34(3) PCT et Regies 13 et 68 PCT). 

3.2. Une demande de brevet ne peut concerner qu'une invention ou une pluralite 
d'inventions liees entre elles de telle sorte qu'elles ne forment qu'un seul concept 
inventif general. 

L'exigence d 'unite d'invention (Regie 13.1 PCT) n'est observee que s'il existe entre 
ces inventions une relation technique portant sur un ou plusieurs elements 
techniques particuliers identiques ou correspondants. L'expression elements 
techniques particuliers s'entend des elements techniques qui determined une 
contribution de chacune des inventions revendiquees, consideree comme un tout, 
par rapport a I'etat de la technique. 
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3.3. La relation technique reliant I'objet des revendications 1-28 est relative au fait 
qu'elles concernent toutes des materiaux nucleiques derives d'un extrait de plasma 
de patients atteints de la sclerose en plaques (SEP) et derives d'un retrovirus MSRV 
(Multiple Sclerosis (MS) associated Retrovirus (RV)). 

Cependant, cette relation ne peut pas etre acceptee comme etant un " element 
technique particulier ", tel que defini ci-dessus, des lors qu'elle ne determine pas une 
contribution de chacune des differentes inventions revendiquees, consideree comme 
un tout, par rapport a I'etat de la technique. 

De fait, des sequences derivees de MSRV et se rapportant a la sclerose en plaques 
sont connues dans I'art anterieur. 

Les documents D1 (FR-A-2 737 500s et D2 (WO-A-95 21256) decrivent du materiel 
viral, a I'etat isole ou purifie, et des fragments nucleotidiques, associes a la sclerose 
en plaques. Cela est reconnu par la Demanderesse, comme indique a la page 5 de 
la description de la demande. En outre, ces documents decrivent aussi I'utilisation 
de ces sequences a des fins de diagnostic, prophylactiques et therapeutiques. 

En consequence, le concept general , i.e. des materiaux nucleiques derives d'un 
extrait de plasma de patients atteints de la sclerose en plaques (SEP) et derives d'un 
retrovirus MSRV n'est pas nouveau au vu de I'art anterieur (Article 33 (2) PCT). 

Ainsi, compte tenu de I'art anterieur, la demande de brevet presente un defaut 
d'unite, des lors qu'il est considere que les diverses inventions ou pluralites 
d'inventions suivantes ne sont pas reliees entre elles de telle sorte qu'elles ne 
forment qu'un seul concept inventif general. 

Invention 1 : Revendications 5 ten entier) et 1-3, 13-19 et 22-28 ( partiellement ) 
concernant un materiel nucleique comprenant une sequence nucleotidique SEQ ID 
NO: 1 12, ou codant pour un polypeptide avec une sequence peptidique SEQ ID NO: 
1 13, les peptides correspondents, des compositions de diagnostic, prophylactiques, 
ou therapeutiques, comprenant un fragment des dites sequences et leur utilisation 
dans un procede pour detecter un retrovirus associe a la sclerose en plaques. 



Formulaire PCT/Feuille s<*par6e/409 (feuille 2) (OEB-avril 1997) 



j 



RAPPORT D'EXAMEN Demande Internationale n° PCT/FR98/01460 
PRELIMINAIRE INTERNATIONAL - FEUILLE SEPAREE 



Invention 2 : Revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-28 

( partiellement) concernant un materiel nucleique comprenant une sequence 
nucleotidique SEQ ID NO: 1 14, SEQ ID NO: 1 1 7, ou SEQ ID NO: 120 ou codant pour 
un polypeptide avec une sequence peptidique SEQ ID NO: 115, SEQ ID NO: 118, 
ou SEQ ID NO: 121, les peptides correspondants, des compositions de diagnostic, 
prophylactiques, ou therapeutiques, comprenant un fragment des dites sequences 
et leur utilisation dans un procede pour detecter un retrovirus associe a la sclerose 
en plaques. 

Invention 3 : Revendications 1, 3, 13-15, 18, 19, 21-25, 27 et 28 (partiellement) 
concernant un materiel nucleique comprenant une sequence nucleotidique SEQ ID 
NO: 124, les peptides correspondants, des compositions de diagnostic, 
prophylactiques, ou therapeutiques, comprenant un fragment des dites sequences 
et leur utilisation dans un procede pour detecter un retrovirus associe a la sclerose 
en plaques. 

Invention 4 : Revendications 4 et 6 (en entier) et 1, 2, 13-19 et 21-28 

( partiellement) concernant un materiel nucleique comprenant une sequence 
nucleotidique SEQ ID NO: 130 ou codant pour un polypeptide avec une sequence 
peptidique SEQ ID NO: 1 35 ou SEQ ID NO: 137, les peptides correspondants, des 
compositions de diagnostic, prophylactiques, ou therapeutiques, comprenant un 
fragment des dites sequences et leur utilisation dans un procede pour detecter un 
retrovirus associe a la sclerose en plaques. 

Invention 5 : Revendications 1,11,13-15, 18-20, 22-25, 27 et 28 (partiellement) 
concernant un materiel nucleique comprenant une sequence nucleotidique SEQ ID 
NO: 141 ou SEQ ID NO: 142, les peptides correspondants, des compositions de 
diagnostic, prophylactiques, ou therapeutiques, comprenant un fragment des dites 
sequences et leur utilisation dans un procede pour detecter un retrovirus associe a 
la sclerose en plaques. 

3.4. En reponse a I'invitation a limiter les revendications ou a payer des taxes 
additionnelles, datee du 18.06.99, la Demanderesse a choisi de ne pas payer de 
taxes additionnelles et a done limite la demande a Y lnvention 2 . Revendications 7- 
10 et 12 ( en entier) et 1, 2, 11 et 13-28 (partiellement). 
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En consequence, le present Rapport d'Examen Preliminaire International est done 
limite a I'objet des revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-28 
( partiellement) . 

SECTION V 

4. II est fait reference aux documents suivants: 

D1: FR-A-2 737 500 
D2: WO-A-95 21256 

5. Activite Inventive (Article 33(3) PCT) 

La presente demande ne satisfait pas aux criteres enonces a I'Article 33 (3) PCT, 
I'objet de Y lnvention 2 . Revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-28 
( partiellement) n'impliquant pas d'activite inventive (Regie 65 (1) et (2) PCT). 

L'etat de la technique le plus proche pour juger de I'activite inventive de ylnvention 
2 est considere comme etant represents par le document D1 ou D2. 
Les documents D1 et D2, chacun pris separement, decrivent du materiel viral, a l'etat 
isole ou purifie, et des fragments nucleotidiques, notamment des sequences du gene 
env du genome retroviral MSRV-1, associes a la sclerose en plaques. Ces 
documents decrivent aussi des sondes nucleiques de detection qui s'hybridet auxdits 
lesdits fragments, des amorces comprenant tout ou partie desdits fragments et des 
peptides codes par lesdits fragments. En outre, ces documents decrivent I'utilisation 
de ces sequences a des fins de diagnostic, prophylactiques et therapeutiques. 

(Voir D1: Abrege, page 2, ligne 31 -page 8, ligne 5, exemples 7-9 et revendications 
et D2: Abrege, page 3, ligne 5-page 17, ligne 14 et revendications). 

La difference entre I'obiet revendiaue et celui des dn numents D1 ou D2 reside dans 
les sequences specifiques derivees d'un retrovirus MSRV (Multiple Sclerosis (MS) 
associated Retrovirus (RV)). 
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Au vu du document D1 ou D2, le probleme technique a resoudre par I' Invention 2 est 
celui de fournir des sequences nucleotidiques et peptidiques supplementaires, derivees 
du MSRV. 

II est de noter que le probleme a resoudre n'est pas celui de deduire la sequence 
complete et exacte du genome repficatif du retrovirus. 

La solution apportee par le Demanderesse reside dans un materiel nucleique 
comprenant une sequence nucleotidique SEQ ID NO:1 14, SEQ ID NO:1 17 ou SEQ ID 
NO: 120 et codant, respectivement, pour un polypeptide presentant les sequences 
peptidiques SEQ ID NO:115, SEQ ID NO:1 18 ou SEQ ID NO:121. 

En outre, les documents D1 et D2 indiquent que le MSRV, un virus qui contient de 
TARN, a une variability consecutive notamment a des taux relativement eleves de 
mutation spontanee. Cela implique done que le MSRV comprend de multiples souches 
ou sous-families. (Voir D1: page 8, lignes 16-25 et D2: page 15, lignes 23-32). 

L'homme du metier n'aurait done besoin d'aucune activite inventive pour utiliser les 
sequences decrites dans D1 et D2, qui sont deja associees aux genomes 
correpondants, pour produire des amorces et/ou des sondes nucleiques pour la 
detection d'un agent pathogene et/ou infectant associe a la sclerose en plaques. Cela 
permettrait done d'isoler des sequences supplementaires derivees du MSRV, qui 
permettraient de caracteriser d'autres regions du genome de MSRV, ou des variants de 
MSRV. 

Les sequences de Vlnvention 2 servent uniquement a une caracterisation additionnelle 
du systeme retroviral implique dans la SEP (sclerose en plaques). De ce fait, les 
sequences de la presente demande sont des solutions alternatives pour un probleme 
qui avait deja ete resolu de maniere similaire dans Tart anterieur (D1 et D2). 
Ces sequences sont considerees comme relevant d'une selection arbitraire entre 
plusieurs sequences derivees du MSRV disponibles, et qui n'implique done a priori 
aucune activite inventive. 
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Une telle selection releve d'une activite inventive uniquement au cas ou elle produit un 
effet inattendu/surprenant non envisage par I'homme du metier. Un tel effet n'a pas ete 
demontre pour les sequences de ^Invention 2. 

L'objet des revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-26 (partiellement) 
n'implique done pas d'activite inventive. 

Des lors que les documents D1 et D2 decrivent I'utilisation de sequences derivees du 
MSRV associe a la sclerose en plaques a des fins de diagnostic, prophylactiques et 
therapeutiques, ainsi que pour la detection de retrovirus associe a la sclerose en 
plaques, I'objet des revendications 27 et 28 decoule de maniere evidente pour I'homme 
du metier. 

En conclusion, au regard des documents cites, la matiere de ^ Invention 2 . 
Revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-28 (partiellement) , n'implique 
pas d'activite inventive. 

6. Application Industrielle (Article 33(4) PCT) 

L'objet de V lnvention 2 . Revendications 7-10 et 12 (en entier) et 1, 2, 11 et 13-28 

( partiellement) est susceptible d'application industrielle (Article 33 (4) PCT). 

SECTION VI 

7. Reference au document de brevet suivant a ete faite selon la Regie 64.3 PCT et celui-ci 
est des lors cite selon la Regie 70.10 PCT: 

WO-A-98 23755, publie le 04.06.98, depose le 26.1 1.97 et ayant la date de priorite 
du 26.11.96. 
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SECTION VIII 

8.1. La presente demande ne remplit pas les conditions enoncees a I'Article 6 PCT, les 
revendications suivantes n'etant pas claires. 

1 . Les expressions " sequences equivalentes " (revendications 1, 7, 14, 15, 20 et 24) 
et " sequences identiques partiellement " (revendication 24) rendent ces 
revendications pas claires. Ces expressions sont vagues et imprecises dans la 
mesure ou elles n'indiquent pas le degre d'equivalence ou d'identite, ni ne donnent 
aucune indication concernant les caracteristiques structurelles ou fonctionnelles 
de ces sequences equivalentes et/ou identiques partiellement. Ces expressions 
permettent done toute interpretation individuelle. 

2. L'objet de la revendication 18 manque de clarte du aux termes " hvbrider " et 
" sonde ". Ces termes sont vagues et imprecis, des lors qu'ils n'indiquent pas 
respectivement les conditions d'hybridation et la taille de la sonde. Ce fait rend ces 
termes completement ouvert a toute interpretation individuelle. 

3. Le terme" partie de " rend la revendication 20 pas claire. 

Ce terme est vague et imprecis, dans la mesure ou il n'indique pas la taille de cette 
partie, la region de la sequence nucleotidique a laquelle cette partie correspond, 
ni la fonction de cette partie. 

Bien que la signification des termes et expressions ci-dessus mentionnes soit indiquee 
dans la description (pages 11-15), la signification d'une revendication doit se degager 
des termes memes de la revendication (voir Directives pour PCT, Clll 4.2). 

8.2. En ce qui concerne la description, la chose suivante devrait aussi etre consideree: 

- page 21, ligne 18: il semble que I'expression " figure 27 " n'est pas correcte 

- page 28, ligne 12: le numero des SEQ ID des amorces manque. 
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REVENDICATIONS 

5 1. Materiel nucleique, a l f etat isole ou purifie, 

comprenant une sequence nucleotidique choisie dans le 
groupe* qui consiste en (i) les sequences SEQ ID NO: 112, 
•SEQ ID NO: 114, SEQ ID NO: 117, SEQ I'D NO: 12 0, 

SEQ ID NO: 124, SEQ ID NO: 130, SEQ ID NO: 141 et 

10 SEQ ID NO: 142 ; (ii) les sequences complementaires des 
sequences (i) ; et (iii) les sequences equivalentes aux 
sequences (i) ou (ii) , en particulier les sequences 
presentant pour toute suite de 100 monomeres contigus, au 
moins 50 %, et pref erentiellement au moins 70 % 

15 d'homologie avec respectivement les sequences (i) ou (ii) , 
ledit materiel nucleique etant different de la sequence 
SEQ ID NO: HSAC64 du clone humain RG083M05 accessible sur 
la base de donnee EMBLHTG. 

2. Materiel nucleique, a l'etat isole ou purifie, 

2 0 codant pour un polypeptide presentant, pour toute suite 

contigue d'au moins 3 0 acides amines, au moins 50 %, et de 
preference au moins 70 % d^omologie, avec une sequence 
peptidique choisie dans le groupe qui consiste en 
SEQ ID NO: 113, SEQ ID N°115, SEQ ID NO: 118, 

25 SEQ ID NO: 121, SEQ ID NO: 13 5 et SEQ ID NO: 13 7, ledit 
materiel nucleique etant different de la sequence 
SEQ ID NO: HSAC64 du clone humain RG08 3M05 accessible sur 
la base de donnee EMBLHTG. 

3. Materiel nucl&ique retroviral, dont le gene pol 

3 0 comprend une sequence nucleotidique identique ou 

6quivalente a une sequence choisie dans le . groupe qui 
consiste en SEQ ID NO: 112, SEQ ID NO: 124 et _ leurs 
sequences complementaires . 

4. Materiel nucleique retroviral, dont l'extremite 
35 5 1 du gSne pol commence au nucleotide 1419 de 

SEQ ID NO: 130. 



FEUILLE MODIFiEE 



40 

5. Materiel nucleique retroviral , dont le gene pol 
code pour un polypeptide presentant, pour toute suite 
contigue d'au moins 3 0 acides amines, au moins 50 %, et de 
preference au moins 70 % d'homologie, avec la sequence 

5 peptidique SEQ ID NO: 113. 

6. Materiel nucleique retroviral, dont I'extremite 
3» du g&ne gag finit au nucleotide 1418 de SEQ ID NO: 130. 

7. Materiel nucleique retroviral, dont le- gene env 
comprend une sequence nucleotidique identique ou 

10 equivalente S une sequence choisie dans le groupe qui 
consiste en SEQ ID NO: 117, et ses sequences 
complementaires. 

8 . Materiel nucleique retroviral , dont le gene env 
comprend une sequence nucleotidique qui commence au 

15 nucleotide 1 de SEQ ID NO: 117 et finit au nucleotide au 
nucleotide 23 3 de SEQ ID NO: 114. 

9. Materiel nucleique retroviral, dont le gene env 
code pour un polypeptide presentant, pour toute suite 
contigue d'au moins 30 acides amines, au moins 50 %, et de 

20 preference au moins 70 % d'homologie, avec la sequence 
SEQ ID NO: °118. 

10. Materiel nucleique retroviral dont la region 
U3R du LTR 3 9 comprend une sequence nucleotidique qui se 
termine au nucleotide 617 de SEQ ID NO: 114. 

25 11. Materiel nucleique retroviral dont la region 

RU5 du LTR 5 » comprend une sequence nucleotidique qui 
commence au nucleotide 7 55 de SEQ ID NO: 12 0 et finit au 
nucleotide 3 37 de SEQ ID NO: 141 ou SEQ ID NO: 142. 

12. Materiel nucleique retroviral comprenant une 
3 0 sequence qui commence au nucleotide 755 de SEQ ID NO: 120 

et qui se termine au nucleotide 617 de SEQ ID NO: 114. 

13. Materiel nucleique retroviral selon l f une 
quelconque des revendications precedentes, caractSrise en 
ce qu'il est associe a au moins une maladie auto-immune 

35 telle que la sclerose en plaques ou la polyarthrite 
rhumatoxde. 
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% 14. Fragment nucleotidique comprenant une sequence 
nucleotidique choisie dans le groupe qui consiste en (i) 
les sequences SEQ ID NO: 112, SEQ ID NO: 114, 

SEQ ID NO: 117, SEQ ID NO: 120, SEQ ID NO: 124, 

5 SEQ ID NO: 13 0, SEQ ID NO: 141 et SEQ ID NO: 142 ; (ii) 
les sequences complementaires des sequences (i) ; et (iii) 
les sequences equivalentes aux sequences (i) ou (ii) , en 
particulier les sequences presentant pour toute suite de 
100 monomeres contigus, au moins 50 %, et 

10 pref §rentiellement au moins 70 % d'homologie avec 
respectivement les -sequences (i) ou (ii) , ledit fragment 
nucleotidique etant different de la sequence 
SEQ ID NO: HSAC64 du clone humain RG083M05 accessible sur 
la base de donnee EMBLHTG. 

15 15. Fragment nucleotidique selon la revendication 

14, consistant en une sequence nucleotidique choisie dans 
le groupe qui consiste en (i) les sequences 
SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 

2 0 SEQ ID NO: 141 et SEQ ID NO: 142 ; (ii) les sequences 
complementaires des sequences (i) ; et (iii) les sequences 
equivalentes aux sequences (i) ou (ii) , en particulier les 
sequences presentant pour toute suite de 100 monomeres 
contigus, au moins 50 %, et pref erentiellement au moins 

25 70 % d'homologie avec respectivement les sequences (i) ou 
(ii) . 

16. Fragment nucleotidique comprenant une sequence 
nucleotidique codant pour un polypeptide presentant, pour 
toute suite contigue d'au moins 3 0 acides amines, au moins 

30 50 %, et de preference au moins 70 % d'homologie, avec une 
sequence peptidique choisie dans le groupe qui consiste en 
SEQ ID NO: 113, SEQ ID N°115, SEQ ID NO: 118, 

SEQ ID NO: 121, SEQ ID NO: 13 5 et SEQ ID NO: 137, ledit 
fragment nucleotidique etant different de la sequence 

35 SEQ ID NO: HSAC64 du clone humain RG083M05 accessible sur 
la base de donnee EMBLHTG. 
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17 . Fragment nucleotidique selon la revendication 
16, consistant en une sequence nucleotidique codant pour 
un polypeptide prSsentant, pour toute suite contigue d'au 
moins 30 acides amines, au moins 50 %, et de preference au 

5 moins 70 % d'homologie, avec une sequence peptidique 
choisie dans le groupe qui consiste en SEQ ID NO: 113, 
SEQ ID N°115, SEQ ID NO: 118, SEQ ID NO: 121, 

SEQ ID NO: 135 et SEQ ID NO: 137. 

18. Sonde nucleique pour la detection d'un 
10 retrovirus associe a la sclerose en plaques et/ou la 

polyarthrite rhumatoxde, caracterisee en ce qu'elle est 
susceptible de s'hybrider specif iquement sur tout fragment 
selon l'une quelconque des revendications 14 & 17, 
appartenant au genome dudit retrovirus. 
15 19. Sonde selon la revendication 18, caracterisee 

en ce qu'elle possede de 10 a 100 nucleotides, de 
preference de 10 a 30 nucleotides. 

20. Amorce pour 1 ' amplification par polymerisation 
d'un ARN ou d'un ADN d'un retrovirus associe a la sclerose 

2 0 en plaques et/ou la polyarthrite rhumatoide, caracterisee 

en ce qu'elle comprend une sequence nucleotidique 
identique ou equivalente a au moins une partie de la 
sequence nucleotidique d'un fragment selon l'une 
quelconque des revendications 14 a 17, notamment une 
25 sequence nucleotidique presentant pour toute suite de 10 
monomeres contigus, au moins 50 %, de preference au moins 
70 % d'homologie avec au moins ladite partie dudit 
fragment . 

21. Amorce selon la revendication 20, caracterisee 

3 0 en ce que sa sequence nucleotidique est choisie parmi 

SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 

SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 

SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, et 

SEQ ID NO: 133. 

35 22. ARN comprenant un fragment genomique du 

materiel nucleique selon l'une quelconque des 
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revendications 1 a 7 ou un fragment selon l'une quelconque 
des revendications 14 a 17 . 

23. ADN comprenant un fragment genomique du 
materiel nucleique selon l'une quelconque des 

5 revendications 1 a 7 ou un fragment selon l'une quelconque 
des revendications 14 a 17. 

24. Vecteur de replication et/ou d ' expression, 
comprenant un fragment genomique du materiel nucleique 
selon l'une quelconque des revendications l a 7 ou un 

10 fragment selon l'une quelconque des revendications 14 Sl 
17. 

25. Peptide code par tout cadre de lecture ouvert 
appartenant a un fragment nucleotidique selon l'une 
quelconque des revendications 14 a 17, notamment 

15 polypeptide, par exemple oligopeptide formant ou 
comprenant un determinant antigenique reconnu par des sera 
de patients infectes par le virus MSRV-1, et/ou chez 
lesquels le virus MSRV-1 a ete reactive. 

26. Peptide selon la revendication 25 comprenant 

2 0 une sequence identique, partiellement ou totalement, ou 

equivalente a une sequence choisie parmi SEQ ID NO: 113, 
SEQ ID N°115, SEQ ID NO: 118, SEQ ID NO: 121, 

SEQ ID NO: 135 et SEQ ID NO: 137. 

27. Composition diagnostique, prophylactique, ou 
25 therapeutique, notamment pour inhiber 1' expression d'au 

moins un retrovirus associe a la sclerose en plaques et/ou 
a la polyarthrite rhumatoide, comprenant un fragment 
nucleotidique selon l'une quelconque des revendications 14 
a 17. 

3 0 28. Procede pour detecter un retrovirus associe a 

la sclerose en plaques et/ou a la polyarthrite rhumatoide, 
dans un echantillon biologique, caracterise en ce que l'on 
met en contact un ARN et/ou un ADN presume appartenir ou 
provenant dudit retrovirus, ou leur ARN et/ou ADN 
35 complementaire, avec une composition comprenant un 
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WO 99/02666 - 1 - PCT/FR98/01460 

RETROVIRAL NUCLEIC MATERIAL AND NUCLEOTIDE FRAGMENTS , 
IN PARTICULAR ASSOCIATED WITH MULTIPLE SCLEROSIS AND/OR 
RHEUMATOID ARTHRITIS, FOR DIAGNOSTIC, PROPHYLACTIC AND 

THERAPEUTIC USES 

5 

Multiple sclerosis (MS) is a demyelinizing 
disease of the central nervous system (CNS) of which 
the complete cause still remains unknown. 

Numerous studies have supported the hypothesis 
10 for a viral etiology of the disease, but none of the 
known viruses tested has proved to be the causative 
agent tested for: a review of the viruses tested for in 
MS for many years has been carried out by E. Norrby and 
R . T . Johnson . 

15 Recently, a retrovirus, different from the 

known human retroviruses, was isolated from patients 
suffering from MS. The authors were able to show that 
this retrovirus could be transmitted in vitro, that 
patients suffering from MS produced antibodies capable 

20 of recognizing proteins associated with the infection 
of the leptomeningeal cells by this retrovirus, and 
that the expression of the latter could be greatly 
stimulated by the immediate-early genes of some 
herpesviruses . 

2 5 All these results argue in favor of the role in 

MS of at least one unknown retrovirus or of a virus 
having a reverse transcriptase (RT) activity which is 
detectable by the method published by H. Perron and 
termed w LM7-type RT" activity. 

3 0 The studies by the applicant have made it 

possible to obtain two continuous cell lines infected 
with natural isolates obtained from two different 
patients suffering from MS, by a culture method as 
described in the document WO-A-93 20188,1 whose content 
3 5 is incorporated by reference into the present 
description. These two lines derived from cells of 
human choroid plexus, called LM7PC and PLI-2, were 
deposited at the E.C.A.C.C. on 22 July 1992 and 
8 January 1993, respectively, under numbers 92 072201 
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and 93 010817, in accordance with the provisions of the 
Treaty of Budapest. Moreover, the viral isolates 
possessing an LM7-type RT activity have also been 
deposited at the E.C.A.C.C. under the overall name of 
"strains". The "strain" or isolate harbored by the 
PLI-2 line, called POL-2, was deposited at the 
E.C.A.C.C. on 22 July 1992 under No. V92072202. The 
"strain" or isolate harbored by the LM7PC line, called 
MS7PG, was deposited at the E.C.A.C.C. on 8 January 
1993 under No. V93010816. 

Using the abovementioned cultures and isolates, 
characterized by biological and morphological criteria, 
efforts were then made to characterize the genetic 
material associated with the viral particles produced 
in these cultures. 

The proportions of genome already characterized 
were used to develop molecular detection tests for the 
viral genome and immunoserological tests, using the 
amino acid sequences encoded by the nucleotide 
sequences of the viral genome, in order to detect the 
immune response directed against epitopes associated 
with the viral infection and/or expression. 

These tools have already made it possible to 
confirm an association between MS and the expression of 
the sequences identified in the patents cited further 
on. However, the viral system discovered by the 
applicant is related to a complex retroviral system. 
Indeed, the sequences which are found to be 
encapsidated in the extracellular viral particles 
produced by the different cultures of cells of patients 
suffering from MS show clearly that there is co- 
encapsidation of retroviral genomes which are related 
but different from the "wild-type" retroviral genome 
which produces the infectious viral particles. This 
phenomenon was observed between replicative 
retroviruses and endogenous retroviruses belonging to 
the same family, or even heterologous retroviruses. The 
concept of endogenous retrovirus is very important in 
the context of our discovery because, in the case of 
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MSRV-1, it has been observed that endogenous retroviral 
sequences comprising sequences homologous to the MSRV-1 
genome exist in normal human DNA. The existence of 
endogenous retroviral elements (ERV) related to MSRV-1 
5 through all or part of their genome explains the fact 
that the expression of the MSRV-1 retrovirus in human 
cells can interact with related endogenous sequences. 
These interactions are found in the case of pathogenic 
and/or infectious endogenous retroviruses (for example 

10 some ecotropic strains of the Murine Leukemia virus) , 
in the case of exogenous retroviruses whose nucleotide 
sequence may be found partially or completely in the 
form of ERVs, in the genome of the host animal (e.g. 
mouse mammary tumor exogenous virus transmitted via 

15 milk) . These interactions consist mainly of (i) a 
transactivation or co-activation of ERVs by the 
replicative retrovirus, (ii) an "illegitimate" 
encapsidation of related RNAs of ERVs, or of ERVs - or 
even of cellular RNAs - simply possessing compatible 

20 encapsidation sequences, into the retroviral particles 
produced by the expression of the replicative strain, 
which are sometimes transmissible and sometimes with an 
inherent pathogenicity, and ( iii) relatively high 
recombinations between the co-encapsidated genomes , in 

25 particular in the reverse transcription phases, which 
lead to the formation of hybrid genomes, which are 
sometimes transmissible and sometimes with an inherent 
pathogenicity. 

Thus, (i) various MSRV-l-related sequences have 

30 been found in purified viral particles; (ii) molecular 
analysis of the various regions of the MSRV-1 
retroviral genome should be carried out by 
systematically analyzing the co-encapsidated, 

interfering and/or recombinant sequences which are 

3 5 generated by the infection and/or expression of MSRV-1; 
furthermore, some clones may have portions of defective 
sequences produced by the retroviral replication and 
the template and/or transcription errors caused by 
reverse transcriptase; ( iii ) the families of sequences 
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related to the same retroviral genomic region are the 
supports for an overall diagnostic detection which may 
be optimized by the identification of invariable 
regions among the clones expressed and by the 
identification of reading frames responsible for the 
production of antigenic and/or pathogenic polypeptides 
which may only be produced by a portion, or even only 
one, of the clones expressed and under these 
conditions, the systematic analysis of the clones 
expressed in one region of a given gene makes it 
possible to evaluate the frequency of variation and/or 
recombination of the MSRV-1 genome in this region and 
to define the optimum sequences for the applications, 
in particular the diagnostic applications; (iv) the 
pathology caused by a retrovirus such as MRSV-1 may be 
a direct effect of its expression and of the proteins 
or peptides produced as a result, but also an effect of 
the activation, encapsidation, recombination of related 
or heterologous genomes and proteins or peptides 
produced as a result; thus, these genomes associated 
with the expression and/or infection by MSRV-1 are an 
integral part of the potential pathogenicity of this 
virus and therefore constitute diagnostic detection 
supports and particular therapeutic targets. Likewise, 
any agent which is associated with, or which is a 
cofactor for these interactions responsible for the 
pathogenicity in question, such as MSRV-2 or the 
gliotoxic factor described in the patent application 
published under the No. FR-2, 716, 1981 can participate 
in the development of an overall and very effective 
strategy for therapeutic diagnosis, prognosis, 
monitoring and/or integrated therapy for MS in 
particular, but also for any other disease associated 
with the same agents. 

In this context, a parallel discovery has been 
made in another autoimmune disease, rheumatoid 
arthritis (RA) , which has been described in the French4 
patent application filed under the No . 95 02960* This 
discovery shows that, by applying methodological 
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approaches similar to those which were used in the 
studies by the applicant on MS, it has been possible to 
identify a retrovirus expressed in RA which shares the 
sequences described for MSRV-1 in MS and also the co- 
5 existence of an MSRV-2 -associated sequence which is 
also described in MS. As regards MSRV-1, the sequences 
commonly detected in MS and RA relate to the pol and 
gag genes. On the basis of current knowledge, it is 
possible to combine the gag and pol sequences described 
10 with the MSRV-1 strains expressed in these two 
diseases . 

The present patent application has as its 
object various results, supplementary in relation to 
those already protected by the French patent 
15 applications: 
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- Patent Application WO 

The present invention relates, first of all, to 
a nucleic material, which may consist of a retroviral 
material, in isolated or purified state, which may be 
understood or characterized in various ways: 
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- it comprises a nucleotide sequence chosen 
from the group which consists of (i) the sequences 

SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 

5 SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i); and (iii) the sequences 
equivalent to sequences (i) or (ii) , in particular the 
sequences having, for every series of 100 contiguous 
monomers, at least 50%, and preferentially at least 70% 
10 homology with sequences (i) or (ii) respectively; 

- it encodes a polypeptide having, for every 
contiguous series of at least 30 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 

15 of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137; 

- its pol gene comprises a nucleotide sequence 
identical or equivalent to a sequence chosen from the 
group which consists of SEQ ID NO: 112, SEQ ID NO: 124 

2 0 and their complementary sequences; 

- the 5' end of its pol gene starts at 
nucleotide 1419 of SEQ ID NO: 130; 

- its pol gene encodes a polypeptide having, 
for every contiguous series of at least 30 amino acids, 

25 at least 50%, and preferably at least 70% homology with 
the peptide sequence SEQ ID NO: 113; 

- the 3' end of its gag gene ends at nucleotide 
1418 of SEQ ID NO: 130; 

- its env gene comprises a nucleotide sequence 

3 0 identical or equivalent to a sequence chosen from the 

group which consists of SEQ ID NO: 117, and its 
complementary sequences ; 

- its env gene comprises a nucleotide sequence 
which starts at nucleotide 1 of SEQ ID NO: 117 and ends 

35 at nucleotide at nucleotide [sic] 233 of SEQ ID NO: 
114; 

- its env gene encodes a polypeptide having, 
for every contiguous series of at least 30 amino acids, 



at least 50%, and preferably at least 70% homology with 
the sequence SEQ ID NO: 118; 

- the U3R region of its 3' LTR comprises a 
nucleotide sequence which ends at nucleotide 617 of 
SEQ ID NO: 114; 

- the RU5 region of its 5' LTR comprises a 
nucleotide sequence which starts at nucleotide 755 of 
SEQ ID NO: 120 and ends at nucleotide 337 of SEQ ID NO: 
141 or SEQ ID NO: 142; 

- a retroviral nucleic material comprising a 
sequence which starts at nucleotide 755 of SEQ ID NO: 
120 and which ends at nucleotide 617 of SEQ ID NO: 114; 

- the retroviral nucleic material as defined 
above is in particular associated with at least one 
autoimmune disease such as multiple sclerosis or 
rheumatoid arthritis. 

The invention also relates to a nucleotide 
fragment which corresponds to at least one of the 
following definitions: 

- it comprises or consists of a nucleotide 
sequence chosen from the group which consists of (i) 
the sequences SEQ ID NO: 112, SEQ ID NO: 114, 
SEQ ID NO: 117, SEQ ID NO: 12 0, SEQ ID NO : 12 4, 
SEQ ID NO: 13 0, SEQ ID NO: 141 and SEQ ID NO: 142; (ii) 
the sequences complementary to sequences (i); and (iii) 
the sequences equivalent to sequences (i) or (ii), in 
particular the sequences having, for every series of 
100 contiguous monomers, at least 50%, and 
preferentially at least 70% homology with sequences (i) 
or (ii) respectively; 

- it comprises or consists of a nucleotide 
sequence encoding a polypeptide having, for every 
contiguous series of at least 30 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 
of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137. 

Other subjects of the present invention are the 
following : 
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- a nucleic probe for the detection of a 
retrovirus associated with multiple sclerosis and/ or 
rheumatoid arthritis, capable of hybridizing 
specifically with any fragment defined above and 

5 belonging to the genome of said retrovirus; it 
advantageously possesses from 10 to 100 nucleotides, 
preferably from 10 to 3 0 nucleotides; 

- a primer for the amplification, by 
polymerization, of an RNA or of a DNA of a retrovirus 

10 associated with multiple sclerosis and/or rheumatoid 
arthritis, which comprises a nucleotide sequence 
identical or equivalent to at least a portion of the 
nucleotide sequence of a fragment defined above, in 
particular a nucleotide sequence having, for every 

15 series of 10 contiguous monomers, at least 50%, 
preferably at least 70% homology with at least said 
portion of said fragment; preferably the nucleotide 
sequence of a primer of the invention is chosen from 
SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 

20 SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 

SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, and 
SEQ ID NO: 13 3 ; 

- an RNA or a DNA, and in particular a 
replication and/or expression vector, comprising a 

25 genomic fragment of the nucleic material or a fragment 
defined above? 

- a peptide encoded by any open reading frame 
belonging to a nucleotide fragment defined above, in 
particular a polypeptide, for example oligopeptide 

3 0 forming or comprising an antigenic determinant 
recognized by sera of patients infected with the MSRV-1 
virus, and/or in whom the MSRV-1 virus has been 
reactivated; a preferential peptide comprises a 
sequence identical, partially or completely, or 

3 5 equivalent to a sequence chosen from SEQ ID NO: 113, 
SEQ ID NO: 115, SEQ ID NO: 118, SEQ ID NO: 121, 

SEQ ID NO: 135 and SEQ ID NO: 137; 

- a diagnostic, prophylactic or therapeutic 
composition, in particular for inhibiting the 
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expression of at least one retrovirus associated with 
multiple sclerosis and/or rheumatoid arthritis, 
comprising a nucleotide fragment defined above; 

- a method for detecting a retrovirus 
5 associated with multiple sclerosis and/or rheumatoid 

arthritis , in a biological sample, comprising the steps 
consisting of bringing an RNA and/or a DNA assumed to 
belong to or obtained from said retrovirus, or their 
complementary RNA and/or DNA, into contact with a 
10 composition comprising a nucleotide fragment defined 
above . 

Before detailing the invention, various terms 
used in the description and the claims are now defined: 

- strain or isolate is understood to mean any 
15 infectious and/or pathogenic biological fraction 

containing, for example, viruses and/or bacteria and/or 
parasites, generating a pathogenic and/or antigenic 
power, harbored by a culture or a live host; by way of 
example, a viral strain according to the preceding 
20 definition may contain a co-infectious agent, for 
example a pathogenic protist, 

- the term "MSRV" used in the . present 
description designates any pathogenic and/or infectious 
agent, as associated with MS, in particular a viral 

2 5 species, the attenuated strains of said viral species, 

or the interfering defective particles or particles 
containing co-encapsidated genomes or alternatively 
genomes recombined with a portion of the MSRV-1 genome, 
which are derived from this species. It is known that 

3 0 viruses and particularly viruses containing RNA exhibit 

variability, following in particular relatively high 
rates of spontaneous mutation, which will be taken into 
account below to define the concept of equivalence, 

- human virus is understood to mean a virus 
3 5 capable of infecting or of being harbored by human 

beings , 

- given all the natural or induced variations 
and/ or recombination which may be encountered in 
practice in the present invention, the objects thereof, 




- 10 - 



defined above and in the claims, have been expressed by 
comprising the equivalents or derivatives of the 
various biological materials defined below, in 
particular homologous nucleotide or peptide sequences, 
5 - the variant of a virus or of a pathogenic 

and/or infectious agent according to the invention 
comprises at least one antigen recognized by at least 
one antibody directed against at least one 
corresponding antigen of said virus and/or of said 

10 pathogenic and/or infectious agent, and/or a genome in 
which any portion is detected by at least one 
hybridization probe, and/or at least one nucleotide 
amplification primer specific for said virus and/or 
pathogenic and/or infectious agent, under defined 

15 hybridization conditions well known to persons skilled 
in the art, 

- according to the invention, a nucleotide 
fragment or an oligonucleotide or a polynucleotide is a 
stretch of monomers, or a biopolymer, characterized by 

20 the informational sequence of the natural nucleic 
acids, which is capable of hybridizing to any other 
nucleotide fragment under predefined conditions, it 
being possible for the stretch to contain monomers of 
different chemical structures and to be obtained from a 

25 natural nucleic acid molecule and/or by genetic 
recombination and/or by chemical synthesis; a 
nucleotide fragment may be identical to a genomic 
fragment of the MSRV-1 virus considered by the present 
invention, in particular a gene of the latter, for 

3 0 example pol or env in the case of said virus; 

- thus, a monomer may be a natural nucleic acid 
nucleotide in which the constituent components are a 
sugar, a phosphate group and a nitrogen base; in RNA, 
the sugar is ribose; in DNA, the sugar is 2-deoxy- 

3 5 ribose; depending on whether DNA or RNA is involved, 
the nitrogen base is chosen from adenine, guanine, 
uracil, cytosine, thymine; or the nucleotide may be 
modified in at least one of the three constituent 
components; by way of example, the modification may 



occur at the level of the bases, generating modified 
bases such as inosine, 5 -methyl -deoxycyt idine , 
deoxyuridine, 5-dimethylamineodeoxyuridine [sic] , 

2 , 6-diamineopurine [sic], 5-bromodeoxyuridine and any 
other modified base promoting hybridization; at the 
level of the sugar, the modification may consist in the 
replacement of at least one deoxyribose with a 
polyamide, and at the level of the phosphate group, the 
modification may consist in its replacement with 
esters, in particular chosen from the esters of 
diphosphate, of alkyl and arylphosphonate and of 
phosphorothioate , 

- "informational sequence" is understood to 
mean any ordered series of monomers, whose chemical 
nature and in which the order in a reference direction, 
constitute or otherwise a functional information of the 
same quality as that for the natural nucleic acids, 

- hybridization is understood to mean the 
process during which, under appropriate operating 
conditions, two nucleotide fragments, having 
sufficiently complementary sequences, become annealed 
to form a complex, in particular a double or triple, 
structure, preferably in helical form, 

- a probe comprises a nucleotide fragment 
synthesized by the chemical route or obtained by 
digestion or enzymatic cleavage of a longer nucleotide 
fragment, comprising at least six monomers, 
advantageously from 10 to 100 monomers, preferably 10 
to 3 0 monomers, and possessing a hybridization 
specificity under defined conditions; preferably, a 
probe possessing less than 10 monomers is not used 
alone, but is used in the presence of other probes 
which are equally short in length or otherwise; under 
certain specific conditions, it may be useful to use 
probes which are greater than 100 monomers in size; a 
probe may be used in particular for diagnostic 
purposes, and it may be, for example, capture and/or 
detection probes, 
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- the capture probe may be immobilized on a 
solid support by any appropriate means, that is to say 
directly or indirectly, for example by covalent bonding 
or passive adsorption, 

5 - the detection probe may be labeled by means 

of a marker chosen in particular from radioactive 
isotopes, enzymes chosen in particular from peroxidase 
and alkaline phosphatase and those capable of 
hydrolyzing a chromogenic, fluorigenic or luminescent 
10 substrate, chromophoric chemical compounds, 

chromogenic, fluorigenic or luminescent compounds, 
analogs of nucleotide bases, and biotin, 

- the probes used for diagnostic purposes of 
the invention may be used in all known hybridization 

15 techniques, and in particular the so-called "DOT-BLOT" 
technique , " SOUTHERN BLOT" technique , "NORTHERN BLOT" 
technique which is a technique identical to the 
"SOUTHERN BLOT" technique but which uses RNA as target, 
the SANDWICH technique; advantagously , the SANDWICH 

20 technique is used in the present invention, comprising 
a specific capture probe and/or a specific detection 
probe, it being understood that the capture probe and 
the detection probe must have a nucleotide sequence 
which is at least partially different, 

25 - any probe according to the present invention 

may hybridize in vivo or in vitro with the RNA and/ or 
with the DNA, in order to block the replication, in 
particular translation and/or transcription, phenomena 
and/or to degrade said DNA and/ or RNA, 

30 - a primer is a probe comprising at least six 

monomers, and advantageously from 10 to 3 0 monomers, 
possessing hybridization specificity under defined 
conditions, for the initiation of an enzymatic 
polymerization, for example in an amplification 

35 technique such as PCR (Polymerase Chain Reaction), in 
an extension method such as sequencing, in a reverse 
transcription method and the like, 

- two nucleotide or peptide sequences are said 
to be equivalent or derived with respect to each other, 
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or with respect to a reference sequence, if 
functionally the corresponding biopolymers can play 
substantially the same role, without being identical, 
in relation to the application or use considered, or in 
the technique in which they are involved; particularly 
equivalent are two sequences obtained because of the 
natural variability, in particular spontaneous 
mutation, of the species from which they were 
identified, or induced mutation, as well as two 
homologous sequences,, the homology being defined below, 

- "variability" is understood to mean any 
spontaneous or induced modification of a sequence, in 
particular by substitution, and/or insertion, and/or 
deletion of nucleotides and/or of nucleotide fragments, 
and/or extension and/or shortening of the sequence at 
least at one of the ends; a nonnatural variability may 
result from the genetic engineering techniques used, 
for example from the choice of the degenerate or 
nondegenerate synthetic primers selected to amplify a 
nucleic acid; this variability may result in 
modifications of any starting sequence, considered as a 
reference, and which may be expressed by a degree of 
homology with respect to said reference sequence, 

- homology characterizes the degree of identity 
of two compared nucleotide or peptide fragments; it is 
measured by the percentage identity which is in 
particular determined by direct comparison of 
nucleotide or peptide sequences, with respect to 
reference nucleotide or peptide sequences, 

- any nucleotide fragment is said to be 
equivalent to or derived from a reference fragment if 
it has a nucleotide sequence equivalent to the sequence 
of the reference fragment; according to the preceding 
definition, in particular equivalent to a reference 
nucleotide fragment are : 

(a) any fragment capable of hybridizing, at 
least partially, with the complementary to the 
reference fragment, 
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(b) any fragment whose alignment with the 
reference fragment leads to the identification of 
identical contiguous bases, in a greater number than 
with any other fragment obtained from another taxonomic 

5 group, 

(c) any fragment resulting or capable of 
resulting from the natural variability of the species 
from which it is obtained, 

(d) any fragment which may result from genetic 
10 engineering techniques applied to the reference 

fragment , 

(e) any fragment, containing at least eight 
contiguous nucleotides, encoding a peptide homologous 
or identical to the peptide encoded by the reference 

15 fragment, 

(f) any fragment different from the reference 
fragment through insertion, deletion, substitution of 
at least one monomer, extension, or shortening at least 
at one of its ends; for example, any fragment 

2 0 corresponding to the reference fragment, flanked at 

least at one of its ends by a nucleotide sequence not 
encoding a polypeptide, 

- polypeptide is understood to mean in 
particular any peptide of at least two amino acids, in 
25 particular oligopeptide, protein, extracted, separated, 
or substantially isolated or synthesized, through the 
involvement of humans, in particular those obtained by 
chemical synthesis, or through expression in a 
recombinant organism, 

3 0 - polypeptide partially encoded by a nucleotide 

fragment is understood to mean a polypeptide having at 
least three amino acids encoded by at least nine 
contiguous monomers included in said nucleotide 
fragment , 

3 5 - an amino acid is said to be analogous to 

another amino acid when their respective 
physicochemical characteristics, such as polarity, 
hydrophobicity and/or basicity, and/or acidity, and/or 
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neutrality, are substantially the same; thus, a leucine 
is analogous to an isoleucine, 

- any polypeptide is said to be equivalent to 
or derived from a reference polypeptide if the 

5- polypeptides compared have substantially the same 
properties, and in particular the same antigenic, 
immunological, enzymatic and/or molecular recognition 
properties; in particular equivalent to a reference 
polypeptide is: 

10 (a) any polypeptide possessing a sequence in 

which at least one amino acid has been replaced by an 
analogous amino acid, 

(b) any polypeptide having an equivalent 
peptide sequence, obtained by natural or induced 

15 variation of said reference polypeptide, and/or of the 
nucleotide fragment encoding said polypeptide, 

(c) a mimotope of said reference polypeptide, 

(d) any polypeptide from whose sequence one or 
more amino acids of the L series are replaced by an 

20 amino acid of the D series, and vice versa, 

(e) any polypeptide into whose sequence a 
modification of the side chains of the amino acids has 
been introduced, such as for example an acetylation of 
the amine-containing functions, a carboxylation of the 

25 thiol functions, an esterif ication of the carboxyl 
functions , 

(f) any polypeptide in whose sequence one or 
more peptide bonds have been modified, such as for 
example the carba, retro, inverso, retro-inverso, 

3 0 reduced, and methylene-oxy bonds, 

(g) any polypeptide in which at least one 
antigen is recognized by an antibody directed against a 
reference polypeptide , 

- the percentage identity characterizing the 
3 5 homology between two peptide fragments compared is 

according to the present invention at least 50% and 
preferably at least 70%. 

Given that a virus possessing a reverse 
transcriptase enzymatic activity may be genetically 
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characterized both in RNA and DNA form, both the viral 
DNA and RNA will be mentioned in order to characterize 
the sequences relative to a virus possessing such a 
reverse transcriptase activity, termed MSRV-1 according 
to the present description. 

The expressions of order which are used in the 
present description and the claims, such as "first 
nucleotide sequence", are not selected to express a 
particular order, but to define the invention more 
clearly . 

Detection of a substance or agent is understood 
below to mean an identification, a quantification or a 
separation or isolation of said substance or of said 
agent . 

The invention will be understood more clearly 
on reading the detailed description which follows which 
is made with reference to the appended figures in 
which: 

Figure 1 represents the general structure of 
the proviral DNA and the genomic RNA of MSRV-1. 

Figure 2 represents the nucleotide sequence of 
the clone called CL6-5' (SEQ ID NO: 112) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 3 represents the nucleotide sequence of 
the clone called CL6-3 ' (SEQ ID NO: 114) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 4 represents the nucleotide sequence of 
the clone called C15 (SEQ ID NO: 117) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 5 represents the nucleotide sequence of 
the clone called 5M6 (SEQ ID NO: 12 0) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 6 represents the nucleotide sequence of 
the clone called CL2 (SEQ ID NO: 13 0) and three 



potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 7 represents three potential reading 
frames in amino acids expressed by pET28C-clone 2 and 
presented under the nucleotide sequence. 

Figure 8 represents three potential reading 
frames in amino acids expressed by pET21C-clone 2 and 
presented under the nucleotide sequence. 

Figure 9 represents the nucleotide sequence of 
the clone called LB13 (SEQ ID NO: 141) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 10 represents the nucleotide sequence of 
the clone called LA15 (SEQ ID NO: 142) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 11 represents the nucleotide sequence of 
the clone called LB16 (SEQ ID NO: 124) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 12 represents the promoter activity 
expressed in cpm/4 min of the U3R sequences subcloned 
from LTRs of different origins into the plasmid PCAT3 . 
PC AT 3 means plasmid alone, PCAT-PH74 means plasmid plus 
endogenous U3R clone expressed in the placenta, PCAT- 
cl6 means plasmid plus U3R clone amplified in the RNA 
of an MS plasma, PCAT-5M6 means plasmid plus U3R region 
amplified in the cellular DNA, "no plasmid" means 
absence of plasmid in the test. 

Figure 13 represents the MSRV1 env and 3 ' LTR 
sequences. The horizontal arrows indicate the start of 
the env, U3 and R regions. In the env region, the 
signal peptide and the potential immunosuppressive 
region are underlined, the potential glycosilat ion 
sites are boxed and the potential cleavage sites are 
indicated by vertical arrows. In the U3R region: the 
regulatory element CAAT and the TATA Box are 
underlined, the "cap" site and the polyadenylation 
signal are also indicated. 



- 18 - 

Figure 14 represents the 5' LTR (RU5) region 
followed by a PBS site (primer binding site) 
complementary to the Trp tRNA and by a gag gene 
encoding a protein of about 487 amino acids. The amino 
acids conserved in the nucleocapsid are underlined 
twice. The amino acids defining the region of greatest 
homology in the capsid are in bold and underlined once. 
The / symbols in the ■ amino acid sequence indicate 
variations observed depending on the clones and, in the 
nucleotide sequence, they indicate frame jumps in some 
clones. The boxed regions correspond to epitopes 
identified by peptide analysis of the C-terminal 
region . 

Figure 15 represents the integrase region of 
MSRV1, the nucleotide sequence and the amino acid 
sequence deduced from the integrase region 
corresponding to clone 87-23. In Figure 15, // means a 
frame jump which has been suppressed in order to 
restore the potential ORF. The letters in underlined 
bold characters represent the conserved amino acids in 
the retroviral integrases. 

Figure 16 describes the nucleotide and peptide 
sequences of clone B13 (identical to clone FBdl3 
described in previous applications) with indication of 
the ORFs and stop codons represented by a dot. The 
underlined region in bold represents the potential 
immunosuppressive domain. The single underlined domain 
represents the start of the 3' LTR. 

EXMgLE^l^ PREPARATION OF A CL6-5' REGION ENCODING THE 
N-TERMINAL END OF INTEGRASE AND OF A CL6-3 ' REGION 
CONTAINING THE 3' TERMINAL SEQUENCE OF THE MSRV-1 
GENOME 

A 3 ' RACE was carried out on the total RNA 
extracted from plasma from a patient suffering from MS. 
A healthy control plasma, treated under the same 
conditions, was used as negative control. The synthesis 
of cDNA was carried out with an oligo dT primer 
identified by SEQ ID NO: 68 (5' GAC TCG CTG CAG ATC GAT 
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TTT TTT TTT TTT TTT T 3#) and the reverse transcriptase 
"Expand™ RT" from Boehringer according to the 
conditions recommended by the company. A PCR was 
carried out with the enzyme Klentaq (Clontech) under 
5 the following conditions: 94°C 5 min then 93°C 1 min, 
58°C 1 min, 68°C 3 min over 40 cycles and 68°C for 
8 min, with a final reaction volume of 50 |il . 
Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 69 

10 5' GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3'; 

- 3' primer, identified by SEQ ID NO: 68 

A second so-called "seminested" PCR was carried 
out with a 5' primer situated inside the region already 
amplified. This second PCR was carried out under the 
15 same experimental conditions as those used for the 
first PCR, using 10 |il of the amplification product 
derived from the first PCR . 
Primers used for the seminested PCR : 

- 5' primer, identified by SEQ ID NO: 70 

2 0 5 ' CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3 ' ; 

- 3' primer, identified by SEQ ID NO: 68 

The primers SEQ ID NO: 69 and SEQ ID NO: 7 0 are 
specific for the pol region of MRSV-1. 

An amplification product of 1 . 9 Kb was obtained 
25 for the plasma of the MS patient. The corresponding 
fragment was not observed for the healthy control 
plasma. This amplification product was cloned in the 
following manner: 

The amplified DNA was inserted into a plasmid with the 

3 0 aid of the TA Cloning kit®. The 2 |il of DNA solution 

were mixed with 5 ^il of sterile distilled water, 1 |il 
of a 10 times concentrated ligation buffer "10X 
LIGATION BUFFER", 2 |il of "pCR™ VECTOR" (25 ng/ml) and 
1 |il of "T4 DNA LIGASE" . This mixture was incubated 
3 5 overnight at 12 °C. The next steps were carried out in 
accordance with the instructions for the TA Cloning 
kit® ( Invitrogen) . At the end of the procedure, the 
white colonies of recombinant bacteria (white) were 
subcultured so as to be cultured and allow the 
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extraction of the plasmids incorporated according to 
the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
5 gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the Sp6 
promoter present on the cloning plasmid of the TA 

10 cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref. 402119) and the automated sequencing 

15 was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The clone obtained contains a CL6-5' region 
encoding the N-terminal end of integrase and a CL6-3' 

20 region corresponding to the 3' terminal region of 
MSRV-1 and making it possible to define the end of the 
envelope (234 bp) and the U3 and R (401 bp) regions of 
the MSRV1 retrovirus. 

The region corresponding to the N-terminal end 

25 of integrase is represented by its nucleotide sequence 
(SEQ ID NO: 112) in Figure 27. The three potential 
reading frames are presented by their amineo [sic] acid 
sequence under the nucleotide sequence, and the amineo 
[sic] acid sequence of the N-terminal end of integrase 

30 is identified by SEQID NO: 113. 

The C16-3' region is represented by its 
nucleotide sequence ( SEQ ID NO: 114) in Figure 3. The 
three potential reading frames are presented by their 
amineo [sic] acid sequence under the nucleotide 

35 sequence. An amineo [sic] acid sequence corresponding 
to the C- terminal end of the MSRV-1 env protein is 
identified by SEQ ID NO: 115. 

In order to evaluate the promoter activity of 
the LTR obtained from clone 6 (cl6) , a test of promoter 
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activity using the enzyme CAT (chloramphenicol acetyl 
transferase) was carried out with the corresponding U3R 
region. In parallel, a clone containing the same U3R 
region of endogenous retroviral RNA expressed in normal 
5 placenta (PH74) and a clone ( 5M6 ) obtained from DNA 
were tested. The result presented in Figure 12 shows a 
very high promoter activity of the LTR derived from MS 
plasma (cl6) and a significantly much lower activity 
with the sequences of non-MS endogenous origin. 

10 

EXAMPLE 2 : PREPARATION OF THE C15 CLONE CONTAINING THE 
REGION ENCODING A PORTION OF THE MSRV-1 RETROVIRUS 
ENVELOPE 

A RT-PCR was carried out on the total RNA 
15 extracted from virions concentrated by ultra- 
centrif ugation of a synoviocyte culture supernatant 
obtained from an MS patient. The synthesis of cDNA was 
carried out with an oligo dT primer and the reverse 
transcriptase "Expand™ RT" from Boehringer according to 

2 0 the conditions recommended by the company. A PCR was 

carried out with the Expand™ Long Template PCR System 
(Boehringer) under the following conditions: 94°C 5 min 
then 93°C 1 min, 60°C 1 min, 68°C 3 min over 40 cycles 
and 68°C for 8 min and with a final reaction volume of 
25 50 ill. 

Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 6 9 

5 ' GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3 ' ; 

- 3' primer, identified by SEQ ID NO: 116 

3 0 5 ' TGG GGT TCC ATT TGT AAG ACC ATC TGT AGC TT 3 ' 

A second so-called "seminested" PCR was carried 
out with a 5' primer situated inside the region already 
amplified. This second PCR was carried out under the 
same experimental conditions as those used for the 
3 5 first PCR (except that 3 0 cycles were used instead of 
40) , using 10 \il of the amplification product derived 
from the first PCR. 

Primers used for the seminested PCR: 

- 5' primer, identified by SEQ ID NO: 70 
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5 ' CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3 ' ; 
- 3' primer, identified by SEQ ID NO: 116 

The primers SEQ ID NO: 69 and SEQ ID NO: 70 are 
specific for the pol region of MRSV-1. The primer SEQ 
5 ID NO: 116 is specific for the sequence FBdl3 (also 
called B13) and is located in the conserved env region 
among the oncoretroviruses . 

An amplfication product of 1932 bp was obtained 
and cloned in the following manner: 

10 the amplified DNA was inserted into a plasmid with the 
aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 

15 (white) were subcultured so as to be cultured and allow 
the extraction of the plasmids incorporated according 
to the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 

20 gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the SP6 
promoter present on the cloning plasmid of the TA 

2 5 cloning kit®. The reaction prior to the sequencing was 

then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref . 402119) and the automated sequencing 

3 0 was carried out on the Applied Biosystems 3 73 A and 3 77 

apparatus, according to the manufacturer's 

instructions . 

The C15 clone obtained contains a region 
corresponding to the region of the MSRV-1 envelope of 
35 1481 bp. 

The env region of the C15 clone is represented 
by its nucleotide sequence (SEQ ID NO: 117) in 
Figure 5. The three potential reading frames of this 
clone are presented by their amineo [sic] acid sequence 
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under the nucleotide sequence. The reading frame 
corresponding to an MSRV-1 structural env protein is 
identified by SEQ ID NO: 118. 

From the defined sequences obtained from clones 
cl6 and CIS , it was possible to produce a plasmid 
construct encoding a complete envelope followed by the 
3' LTR, as presented in Figure 13 with the 
corresponding reading frame. 

EXAMPLE 3 : PREPARATION OF A 5M6 CLONE CONTAINING THE 
SEQUENCES OF THE 3' TERMINAL REGION OF THE ENVELOPE, 
FOLLOWED BY THE MSRV-1 PROVIRAL TYPE U3 , R AND U5 
SEQUENCES 

A monodirectional PCR was carried out on the 
DNA extracted from immortalized B lymphocytes in 
culture from an MS patient. The PCR was carried out 
with Expand™ Long Template PCR System (Boehringer) 
under the following conditions: 94°C 3 min then 93 °C 
1 min, 60°C 1 min, 68°C 3 min over 10 cycles, then 93°C 
1 min, 60°C 1 min with 15 sec of extension at each 
cycle, 68°C 3 min over 3 5 cycles and 68°C for 7 min and 
with a final reaction volume of 50 |il . 

The primer used for the PCR identified by SEQ 
ID NO: 119 is 5' TCA AAA TCG AAG AGC TTT AGA CTT GCT 
AAC CG 3 ' ; 

The primers [sic] SEQ ID NO: 119 is specific for the 
env region of the C15 clone. 

An amplification product of 1673 bp was 
obtained and cloned in the following manner: 
the amplified DNA was inserted into a plasmid with the 
aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 
(white) were subcultured so as to be cultured and allow 
the extraction of the plasmids incorporated according 
to the so-called "miniprep* procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
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gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
5 promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit tt PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 

10 Biosystems, ref . 402119) and the automated sequencing 
was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The 5M6 clone obtained contains a region 

15 corresponding to the 3' region of the MSRV-1 envelope 
of 492 bp followed by the regions U3 , R and U5 (837 bp) 
of MSRV1. 

The 5M6 clone is represented by its nucleotide 
sequence (SEQ ID NO: 12 0) in Figure 5. The three 

2 0 potential reading frames of this clone are presented by 

their amineo [sic] acid sequence under the nucleotide 
sequence. The reading frame corresponding to the 
C-terminal end of the MSRV-1 env protein is identified 
by SEQ ID NO: 121. 

25 

EXAMPLE 4 : PREPARATION OF THE LB16 CLONE CONTAINING THE 
REGION ENCODING THE MSRV-1 RETROVIRUS INTEGRASE 

An RT-PCR was carried out on the total RNA 
treated with DNAsel and extracted from a choroid plexus 
30 obtained from an MS patient. The synthesis of cDNA was 
carried out with an oligo dT primer and the reverse 
transcriptase "Expand™ RT" from Boehringer according to 
the conditions recommended by the company. A "no RT" 
control was carried out in parallel on the same 

3 5 material. A PCR was carried out with Taq polymerase 

(Perkin Elmer) under the following conditions: 95°C 
5 min, then 95°C 1 min, 55°C 1 min, 72°C 2 min over 35 
cycles and 72°C for 8 min and with a final reaction 
volume of 50 |il . 



Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 122 
5 ' GGC ATT GAT AGC ACC CAT CAG 3 ' ; 

- 3' primer, identified by SEQ ID NO: 123 
5 ' CAT GTC ACC AGG GTG GAA TAG 3 ' 

The primer SEQ ID NO: 122 is specific for the 
pol region of MSRV-1 and more precisely similar to the 
integrase region described above. The primer SEQ ID NO 
123 was defined on sequences of the clones obtained 
during preliminary tests. 

An amplification product of about 7 60 bp was 
obtained only in the test with RT and was cloned in the 
following manner: 

the amplified DNA was inserted into a plasmid with the 
aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 
(white) were subcultured so as to be cultured and allow 
the extraction of the plasmids incorporated according 
to the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit * PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosys terns, ref. 402119) and the automated sequencing 
was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The LB16 clone obtained contains the sequences 
corresponding to integrase. The nucleotide sequence of 




- 26 - 



this clone was identified by SEQ ID NO: 124 in 
Figure 11, three reading frames are determined. 

EXAMPLE 5 : PREPARATION OF A CLONE 2, CL2 , CONTAINING IN 
5 3' A PORTION HOMOLOGOUS TO THE POL GENE, CORRESPONDING 
TO THE PROTEASE GENE, AND TO THE GAG GENE (GM3) 
CORRESPONDING TO THE NUCLEOCAPSID, AND A NEW 5 ' CODING 
REGION, CORRESPONDING TO THE GAG GENE MORE SPECIFICALLY 
THE TEMPLATE AND THE CAPSID of MSRV-1. 

10 A PCR amplification was carried out on the 

total RNA extracted from 100 (il of plasma from a 
patient suffering from MS. A water control, treated 
under the same conditions, was used as negative 
control. The synthesis of cDNA was carried out with 

15 3 00 pmol of a random primer (GIBCO-BRL, France) and the 
reverse transcriptase "Expand RT" (BOEHRINGER MANNHEIM, 
France) according to the conditions recommended by the 
company. An amplification by PCR ( "polymerase chain 
reaction") was carried out with the enzyme Taqr 

2 0 polymerase (Perkin Elmer, France) using 10 |il of cDNA 

under the following conditions: 94°C 2 min, 55°C 1 min, 
72°C 2 min then 94°C 1 min, 55°C 1 min, 72°C 2 min over 
30 cycles and 72°C for 7 min with a final reaction, 
volume of 50 (il . 
25 Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 126 
5 ' CGG ACA TCC AAA GTG ATG GGA AAC G 3 ' ; 

- 3' primer, identified by SEQ ID NO: 127 
5 ' GGA CAG GAA AGT AAG ACT GAG AAG GC 3 ' 

3 0 A second amplification by so-called 

"seminested" PCR was carried out with a 5' primer 
situated inside the region already amplified. This 
second PCR was carried out under the same experimental 
conditions as those used during the first PCR, using 
3 5 10 jil of the amplification product derived from the 
first PCR. 

Primers used for the amplification by seminested PCR: 

- 5' primer, identified by SEQ ID NO: 128 
5 ' CCT AGA ACG TAT TCT GGA GAA TTG GG 3 ' ; 
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- 3' primer, identified by SEQ ID NO: 129 
5 ' TGG CTC TCA ATG GTC AAA CAT ACC CG 3 ' 

The primers SEQ ID NO: [lacuna] and SEQ ID NO: 
[lacuna] are specific for the pol region, clone G+E+A, 
more specifically the E region: nucleotide position No. 
423 to No. 448. The primers used in the 5' region were 
defined on sequences of clones obtained during 
preliminary tests . 

An amplification product of 1511 bp was 
obtained from the RNA extracted from the plasma of an 
MS patient. The corresponding fragment was not observed 
for the water control. This amplification product was 
cloned in the following manner. 

The amplified DNA was inserted into a plasmid 
with the aid of the TA Cloning kit™. The 2 jil of DNA 
solution were mixed with 5 |al of sterile distilled 
water, 1 jal of a 10 times concentrated ligation buffer 
"10X LIGATION BUFFER" , 2 Jil of "pCR™ VECTOR" 
(25 ng/ml)and 1 |il of "T4 DNA LIGASE" . This mixture was 
incubated overnight at 14°C. The following steps were 
carried out in accordance with the instructions of the 
TA Cloning kit® ( Invitrogen) . The mixture was plated 
after transformation of the ligation into E. coli 
INVaF ' bacteria. At the end of the procedure, the white 
colonies of recombinant bacteria were subcultured so as 
to be cultured and allow the extraction of the plasmids 
incorporated according to the so-called "DNA 
minipreparation" procedure (17) . The plasmid 
preparation of each recombinant colony was cut with the 
restriction enzyme EcoRI and analyzed on agarose gel. 
The plasmids possessing an insert detected under UV 
light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
Amplitaq® FS, DyeDeoxy™ Terminator" (Applied 
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Biosystems, ref. 402119) and the automated sequencing 
was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The clone obtained, called CL2 , contains a C- 
terminal region similar to the 5' terminal region of 
the clones G+E+A of MSRV-1, which makes it possible to 
define the C-terminal region of the gag gene and a new 
region corresponding to the N-terminal region of the 
MSRV-1 gag gene. 

CL2 makes it possible to define a region of 
1511 bp having an open reading frame in the N-terminal 
region of 1077 bp encoding 3 59 amino acids and a non- 
open reading frame of 454 bp corresponding to the 
C-terminal region of the MSRV-1 gag gene. 

The nucleotide sequence of CL2 is identified by 
SEQ ID NO: 13 0. It is represented in Figure 6 with the 
potential reading frames in amineo [sic] acid. 

The 1077 bp fragment of CL2 encoding 3 59 amino 
acids was amplified by PCR with the Pwo enzyme (5U/|il) 
(Boehringer Mannheim, France) using 1 jal of the DNA 
minipreparation of clone 2 under the following 
conditions: 95°C 1 min, 60°C 1 min, 72°C 2 min over 
25 cycles and with a final reaction volume of 50 (il 
with the aid of the primers: 

- 5' primer (BairtHI) , identified by SEQ ID NO: 132 

5' TGC TGG AAT TCG GGA TCC TAG AAC GTA TTC 3' (30 mer) , 
and 

- 3' primer (Hindlll) , identified by SEQ ID NO: 133 

5 AGT TCT GCT CCG AAG CTT AGG CAG ACT TTT 3 ' (30 mer ) 
corresponding, respectively, to the nucleotide sequence 
of clone 2 at position -9 to 21 and 1066 to 1095. 

The fragment obtained by PCR was linearized 
with BarnHI and Hindi 1 1 and subcloned into the 
expression vectors pET28C and pET21C (NOVAGEN) 
linearized with BamHl and Hindlll. The sequencing of 
the DNA of the 1077 bp fragment of clone 2 in the two 
expression vectors was carried out according to the 
method recommended for the use of the sequencing kit 
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"PRISM™ Ready Reaction Amplitaq® FS, DyeDeoxy™ 
Terminator" (Applied Biosystems, ref. 402119) and the 
automated sequencing was carried out on the Applied 
Biosystems 373 A and 377 apparatus, according to the 
manufacturer' s instructions . 

The expression of the nucleotide sequence of 
the 1077 bp fragment of clone 2 by the expression 
vectors pET28C and pET21C are identified by SEQ ID NO: 
135 and SEQ ID NO: 137, respectively. 

EXAMPLE 6 : EXPRESSION OF CLONE 2 IN ESCHERICHIA COLI 

The constructs pET28c-clone 2 (1077 bp) and 
pET21C-clone 2 (1077 bp) synthesize, in the bacterial 
strain BL21 (DE3 ) , a protein fused at the N- and C- 
terminus for the vector pET28C and the C-terminus for 
the vector pET21C with 6 Histidines, having an apparent 
molecular mass of about 45 kDa, identified by SDS-PAGE 
polyacrylamide gel electrophoresis (SDS = Sodium 
Dodecyl Sulfate) (Laemmli, 1970 (1)). The reactivity of 
the protein was demonstrated towards an anti-Histidine 
monoclonal antibody (DIANOVA) by the Western-blot 
technique (Towbin et al . , 1979 (2)). 

The recombinant proteins pET2 8c-clone 2 
(1077 bp) and pET21C-clone 2 (1077 bp) were visualized 
by SDS-PAGE in the insoluble fraction after enzymatic 
digestion of the bacterial extracts with 50 |il of 
lysozyme (10 mg/ml) and ultrasound lysis. 

The antigenic properties of the recombinant 
antigens pET28C-clone 2 (1077 bp) and pET21C-clone 2 
(1077 bp) were tested by Westen blotting () [sic] after 
solubilization of the bacterial pellet with 2% SDS and 
50 mM p-mercaptoethanol . After incubation with sera 
from patients suffering from multiple sclerosis, the 
sera from neurological controls and the sera from 
controls at the Blood Transfusion Center (CTS) , the 
immunocomplexes were detected with the aid of an 
alkaline phosphatase-coupled goat serum anti-human IgG 
and ant i -human IgM. 

The results are presented in the table below. 
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TABLE 

Reactivity of sera affected by multiple sclerosis and 
controls with the MSRV-1 recombinant protein gag 
clone 2 (1077 bp) = pET21C-clone 2 (1077 bp) and 
pET28C-clone 2 (1077 bp) a 



DISEASE 



NUMBER OF NUMBER OF POSITIVE 

IND I VI DUAL S I ND I VI DUAL S 
TESTED 



MS 

NEUROLOGICAL 
CONTROLS 
HEALTHY 
CONTROLS (CTS) 



15 



22 



2<+ + +), 2(++), (2(+) 
K+ + +) 
M + /-) 



(a) The strips containing 1 . 5 |ag of recombinant 
antigen pET-gag clone 2 (1077 bp) exhibit reactivity 
10 against sera diluted 1/100. The Western-Blot 
interpretation is based on the presence or absence of a 
specific pET-gag clone 2 (1077 bp) band on the strips. 
Positive and negative controls are included in each 
experiment . 

15 These results show that, under the technical 

conditions used, about 40% of the human sera affected 
by multiple sclerosis which were tested react with the 
recombinant proteins pET28C-clone 2 (1077 bp) and 
pET21C-clone 2 (1077 bp) . Reactivity was observed on a 

20 neurological control and it is of interest to note that 
the RNAs extracted from this serum, after the reverse 
transcriptase step, are also amplified by PCR in the 
pol region. This suggests that people who have not 
declared MS may also harbor and express this virus . On 

2 5 the other hand, an apparently healthy control (CTS 
donor) possesses anti-gag (clone 2, 1077 bp) 
antibodies. This is compatible with an immunity 
acquired against MSRV-1 independently of a declared 
associated autoimmune disease. 
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EXAMPLE 7 : PREPARATION OF AN LB13 CLONE CONTAINING IN 
3 ' A PORTION HOMOLOGOUS TO CLONE 2 CORRESPONDING TO THE 
GAG GENE AND IN 5 ' A PORTION HOMOLOGOUS TO THE 5M6 
CLONE CORRESPONDING TO THE U5 LTR REGION 

An RT-PCR ("reverse transcriptase-polymerase 
chain reaction") was carried out using total RNA 
extracted from virions, obtained from supernatants of B 
lymphocyte cells of patients suffering from multiple 
sclerosis, concentrated by ultracentrif ugations . The 
synthesis of cDNA was carried out with a specific 
primer SEQ No. XXX and the reverse transcriptase 
"Expand™ RT" from BOEHRINGER MANNHEIM according to the 
conditions recommended by the company. 

Primer used for the synthesis of the cDNA, identified 
by SEQ ID NO: 13 8 : 

5 ' CTT GGA GGG TGC ATA ACC AGG GAA T 3 ' 

A PCR amplif ication was carried out with Taq 
polymerase (Perkin Elmer, France) under the following 
conditions: 94°C 1 min, 55°C 1 min, 72°C 2 min over 35 
cycles at 72°C for 7 min and with a final reaction- 
volume of 100 ]xl . 

Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 139 
5 ' TGT CCG CTG TGC TCC TGA TC 3 ' 

- 3' primer, identified by SEQ ID NO: 138 
5 ' CTT GGA GGG TGC ATA ACC AGG GAA T 3 ' 

A second so-called "seminested" PCR 
amplification was carried out with a 3 ' primer situated 
inside the region already amplified. This second 
amplification was carried out under the same 
experimental conditions as those used during the first 
amplification, using 10 jil of the amplification product 
derived from the first PCR. 

Primers used for the "seminested" PCR amplification: 

- 5' primer, identified by SEQ ID NO: 139 
5 ' TGT CCG CTG TGC TCC TGA TC 3 ' 

- 3' primer, identified by SEQ ID NO: 140 
5 ' CTA TGT CCT TTT GGA CTG TTT GGG T 3 ' 
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The primers SEQ ID NO: 13 8 and SEQ ID NO: 140 
are specific for the gag region, clone 2 nucleotide 
position No. 373-397 and No. 433-456. The primers used 
in the 5' region were defined on sequences of the 
5 clones obtained during preliminary tests. 

An amplification product of 7 64 bp was obtained 
and cloned in the following manner: 

The amplified DNA was inserted into a plasmid 
with the aid of the TA Cloning kit™. The 2 |il of DNA 

10 solution were mixed with 5 jal of sterile distilled 
water, 1 |il of a 10 times concentrated ligation buffer 
U 10X LIGATION BUFFER", 2 |il of u pCR™ VECTOR" 
(25 ng/ml)and 1 |il of tt T4 DNA LIGASE" . This mixture was 
incubated overnight at 14°C. The following steps were 

15 carried out in accordance with the instructions of the 
TA Cloning kit® ( Invitrogen) . The mixture was plated 
after transformation of the ligation into E. coli 
INVaF ' bacteria. At the end of the procedure, the white 
colonies of recombinant bacteria were subcultured so as 

20 to be cultured and allow the extraction of the plasmids 
incorporated according to the so-called "DNA 
minipreparation" procedure (17). The plasmid 
preparation of each recombinant colony was cut with the 
restriction enzyme EcoRl and analyzed on agarose gel. 

25 The plasmids possessing an insert detected under UV 
light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
promoter* present on the cloning plasmid of the TA 

3 0 cloning kit . The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
Amplitaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref . 402119) and the automated sequencing 

3 5 was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The LB13 clone obtained contains an N-terminal 
region of MSRV-1 gag gene homologous to clone 2 and an 
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LTR corresponding to a portion of the U5 region. 
Between the U5. region and gag, a binding site for the 
transfer RNAs, the PBS "primer binding site", was 
identified. 

5 The nucleotide sequence of the 7 64 bp fragment 

of the LB13 clone in the plasmid u pCR™ vector" is 
represented in the identifier SEQ ID NO: 141. 

The binding site for the transfer RNAs, having 
a sequence of PBS tryptophan type, was identified at 
10 nucleotide position No. 342-359 of the LB13 clone. 

As this same PBS was found in the endogenous 
copies homologous to MSRV1, the endogenous family thus 
defined is henceforth called HERV W, according to the 
nomenclature proposed for the endogenous retrovirus 
15 families (W= tryptophan) . 

A short ORF of about 65 amino acids was found 
in the U5 region of the 5' LTR of the LB13 clone. 
Sequence of the ORF: 

PMASNRAITLTAWSKIPFLGIRETKNPRSENTRLATMLEAAHHHFGSSPPLSWEL 
2 0 WEQGPQVTIW. 

The corresponding nucleotide sequence starting 
at an ATG codon is capable of being expressed in a 
subgenomic DNA from a proviral LTR (U3RU5) . 

Another clone, called LA15, was obtained on the 

2 5 total RNA extracted from virions concentrated by 

ultracentrifugation from a culture supernatant of 
synoviocytes obtained from a patient suffering from 
rheumatoid arthritis. The strategy for amplifying and 
cloning the LA15 clone is exactly the same which was 

3 0 used for the LB13 clone. 

The nucleotide sequence of the LA15 clone, 
which is represented in the identifier SEQ ID NO: 142, 
is very similar to the LD13 clone. This suggests that 
the MSVR-1 retrovirus detected in multiple sclerosis 
35 has sequences which are similar to those found in 
rheumatoid arthritis. 
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EXAMPLE 8 : RECONSTRUCTION OF AN RU5-GAG REGION FROM THE 
CLONES LB15, LB13 , CL2 AND CL17 

The clones CL2 and LB13 have already been 
described in the preceding examples. The LB15 clone was 
5 obtained using the R sequence of the LTR of the cl6 
clone in order to define a primer in 5' and the anti- 
sense primers used are the same as for the LB13 clone. 
The CL17 clone was obtained by nested RT-PCR using the 
following primers : 

10 

5 ' - TC ATGC AAC TGC AC TC TTC TGGTCCG - 3 ' ( sense) 

5 ' -TCTTGCACTAACCTCCACTGTCCGTTGG-3 ' (antisense) 



5 ' -ATCCCCCAGTAACAATTTGGTGACCACG-3 ' (sense) 
15 5 ' -TCGGGTCTAAGAGGGTACTTCCTTTGGTAGG-3 ' (antisense) 

The LB15 clone was obtained from virions 
obtained by culturing MS cells. The LB17 clone was 
obtained from culturing plasma from an MS patient. 
20 These overlapping clones made it possible to 

reconstruct an RU5-gag sequence with a potential ORF in 
the gag gene, as presented in Figure 14. 



EXAMPLE 9 : PREPARATION OF A CLONE 87-23 

2 5 The region corresponding to integrase was 

amplified and cloned from MS plasma using a seminested 
RT PCR with the following primers situated in the pol 
and env regions of MSRV1 . 
In the pol region: 

3 0 5 ' -TTACGCAGGTCTCAGGGATGAGCTT- 3 ' ( sense -primary PCR) 

5 ' -CGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTA-3 ' ( sense- 

secondary PCR) 



In the env region, 
3 5 5 ' -GGTACGGAGGGTTTCATGTAGTTTTGAG- 3 ' (anti-sense primary 
and secondary PCR) 

The amplified clone contains 774 bp in the 
pol/RT region, all the integrase region (1197 bp) and 
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the start of the env region (480 bp) . The nucleotide 
sequence corresponding to- the integrase region and the 
translation to amino acids of the potential ORF are 
presented in Figure 15. 

5 

EXAMPLE 10 : CONFIRMATION OF THE PRESENCE OF RNA 
CONTAINING ENV SEQUENCES RELATED TO ERV9 IN THE 
RETROVIRAL PARTICLES ASSOCIATED WITH THE MSRV1 GENOME: 

Sequences related to ERV9 have been found in a 

10 minor proportion in the virion preparations obtained 
from MS compared with the MSRV1 sequences. The 
existence of phenomena of co-encapsidation of 
phylogenetically related endogenous sequences into 
retroviral particles produced by a replicative strain 

15 has been described. Surprisingly, an RNA region 
comprising an ORF starting in the 3 ' portion of env and 
continuing potentially into the 3 ' LTR has been found 
in various MS samples. In order to specify the 
existence of an ORF, transcription-translation tests 

2 0 were carried out and made it possible to show the 

reality of an env ORF containing the entire 
transmembrane (TM) portion and ending at the start of 
the putative LTR. However, an additional frame (ORFX) 
follows and continues in the 3' LTR. The two products 
25 of expression were visualized and their respective ORFs 
were subcloned. Figure 16 represents the nucleotide and 
peptide sequences of the B13 clone already described, 
specifying the ORFs in the truncated env region and in 
the putative LTR. The presence of such RNAs may be 

3 0 responsible for recombinations with the replicative 

strain and consequently generate strains having a 
modified pathogenicity. 
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CLAIMS 

1. Nucleic material, in isolated or purified 
state, comprising a nucleotide sequence chosen from the 

5 group which consists of (i) the sequences 

SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 

SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i) ; and (iii) the sequences 
10 equivalent to sequences (i) or (ii) , in particular the 
sequences having, for every series of 100 contiguous 
monomers, at least 50%, and preferentially at least 70% 
homology with sequences (i) or (ii) respectively. 

2. Nucleic material, in isolated or purified 
15 state, encoding a polypeptide having, for every 

contiguous series of at least 30 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 
of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
2 0 SEQ ID NO: 121, SEQ ID NO: 13 5 and SEQ ID NO : 13 7. 

3. Retroviral nucleic material, whose pol gene 
comprises a nucleotide sequence identical or equivalent 
to a sequence chosen from the group which consists of 
SEQ ID NO: 112, SEQ ID NO: 124 and their complementary 

2 5 sequences. 

4. Retroviral nucleic material, in which the 5' 
end of the pol gene starts at nucleotide 1419 of SEQ ID 
NO: 130. 

5. Retroviral nucleic material, in which the pol 

3 0 gene encodes a polypeptide having, for every contiguous 

series of at least 30 amino acids, at least 50%, and 
preferably at least 70% homology with the peptide 
sequence SEQ ID NO: 113. 

6. Retroviral nucleic material, in which the 3' 
3 5 end of the gag gene ends at nucleotide 1418 of SEQ ID 

NO: 13 0. 

7. Retroviral nucleic material, in which the env 
gene comprises a nucleotide sequence identical or 

REPLACEMENT SHEET (RULE 26) 
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equivalent to a sequence chosen from the group which 
consists of SEQ ID NO: 117, and its complementary 
sequences . 

8. Retroviral nucleic material, in which the env 
5 gene comprises a nucleotide sequence which starts at 

nucleotide 1 of SEQ ID NO: 117 and ends at nucleotide 
at nucleotide [sic] 233 of SEQ ID NO: 114. 

9. Retroviral nucleic material, in which the env 
gene encodes a polypeptide having, for every contiguous 

10 series of at least 30 amino acids, at least 50%, and 
preferably at least 70% homology with the sequence 
SEQ ID NO: 118. 

10. Retroviral nucleic material in which the U3R 
region of the 3' LTR comprises a nucleotide sequence 

15 which ends at nucleotide 617 of SEQ ID NO: 114. 

11. Retroviral nucleic material in which the RU5 
region of the 5' LTR comprises a nucleotide sequence 
which starts at nucleotide 755 of SEQ ID NO: 120 and 
ends at nucleotide 33 7 of SEQ ID NO: 141 or SEQ ID NO: 

20 142. 

12. Retroviral nucleic material comprising a 
sequence which starts at nucleotide 755 of SEQ ID NO: 
120 and which ends at nucleotide 617 of SEQ ID NO: 114. 

13 . Retroviral nucleic material according to any 
25 one of the preceding claims, characterized in that it 

is associated with at least one autoimmune disease such 
as multiple sclerosis or rheumatoid arthritis. 

14. Nucleotide fragment comprising a nucleotide 
sequence chosen from the group which consists of (i) 

3 0 the sequences SEQ ID NO: 112, SEQ ID NO: 114, 

SEQ ID NO: 117, SEQ ID NO: 120, SEQ ID NO: 124, 

SEQ ID NO: 13 0, SEQ ID NO: 141 and SEQ ID NO: 142; (ii) 
the sequences complementary to sequences (i); and (iii) 
the sequences equivalent to sequences (i) or (ii) , in 

35 particular the sequences having, for every series of 
100 contiguous monomers, at least 50%, and 
preferentially at least 70% homology with sequences (i) 
or (ii) respectively. 
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15. Nucleotide fragment according to Claim 14, 
consisting of a nucleotide sequence chosen from the 
group which consists of (i) the sequences 
SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 
SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 
SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i); and (iii) the sequences 
equivalent to sequences (i) or (ii), in particular the 
sequences having, for every series of 100 contiguous 
monomers, at least 50%, and preferentially at least 70% 
homology with sequences (i) or (ii) respectively. 

16 . Nucleotide fragment comprising a nucleotide 
sequence encoding a polypeptide having, for every 
contiguous series of at least 30 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 
of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137. 

17. Nucleotide fragment according to claim 16, 
consisting of a nucleotide sequence encoding a 
polypeptide having, for every contiguous series of at 
least 3 0 amino acids, at least 50%, and preferably at 
least 70% homology with a peptide sequence chosen from 
the group which consists of SEQ ID NO: 113, SEQ ID NO: 
115, SEQ ID NO: 118, SEQ ID NO: 121, SEQ ID NO: 135 and 
SEQ ID NO: 137 . 

18. Nucleic probe for the detection of a retrovirus 
associated with multiple sclerosis and/or rheumatoid 
arthritis, characterized in that it is capable of 
hybridizing specifically with any fragment according to 
any one of claims 14 to 17, belonging to the genome of 
said retrovirus . 

19. Probe according to claim 18, characterized in 
that it possesses from 10 to 100 nucleotides, 
preferably from 10 to 3 0 nucleotides. 

20. Primer for the amplification, by 
polymerization, of an RNA or of a DNA of a retrovirus 
associated with multiple sclerosis and/or rheumatoid 



REPLACEMENT SHEET (RULE 26) 



- 40 - 

arthritis, characterized in that it comprises a 
nucleotide sequence identical or equivalent to at least 
a portion of the nucleotide sequence of a fragment 
according to any one of claims 8 to 11, in particular a 
nucleotide sequence having, for every series of 10 
contiguous monomers, at least 50%, preferably at least 
70% homology with at least said portion of said 
fragment . 

21. Primer according to claim 20, characterized in 
that its nucleotide sequence is chosen from 
SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 
SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 
SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, and 
SEQ ID NO: 13 3 . 

22 . RNA or DNA, and in particular replication 
and/or expression vector, comprising a genomic fragment 
of the nucleic material according to any one of claims 
1 to 7 or a fragment according to any one of claims 14 
to 17 . 

23. Peptide encoded by any open reading frame 
belonging to a nucleotide fragment according to any one 
of claims 14 to 17, in particular a polypeptide, for 
example oligopeptide forming or comprising an antigenic 
determinant recognized by sera of patients infected 
with the MSRV-1 virus, and/ or in whom the MSRV-1 virus 
has been reactivated. 

24. Peptide according to claim 23 comprising a 
sequence identical, partially or completely, or 
equivalent to a sequence chosen from SEQ ID NO: 113, 
SEQ ID NO: 115, SEQ ID NO: 118, SEQ ID NO : 121, 
SEQ ID NO: 13 5 and SEQ ID NO: 13 7. 

25. Diagnostic, prophylactic or therapeutic 
composition, in particular for inhibiting the 
expression of at least one retrovirus associated with 
multiple sclerosis and/or rheumatoid arthritis, 
comprising a nucleotide fragment according to any one 
of claims 14 to 17. 
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26. Method for detecting a retrovirus associated 

with multiple sclerosis and/or rheumatoid arthritis, in 
a biological sample, characterized in that an RNA 
and/or a DNA assumed to belong to or obtained from said 
retrovirus, or their complementary RNA and/or DNA, is 
brought into contact with a composition comprising a 
nucleotide fragment according to any one of claims 14 
to 17. 
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PCT/FR98/01460 



(2)- INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GACTCGCTGC AGATCGATTT TTTTTTTTTT TTTT 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GCCATCAAGC CACCCAAGAA CTCTTAACTT 

(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCAATAGCCA GACCATTATA TACACTAATT 

(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS: 



35 



(A) LENGTH: 310 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 112- 

GCTTATACAA GGACCCCTAG TATCGGGTAA TCCCCTCTGG GAAACCAAGC CCCAGTACTC 
AGCACCAAAA ATAGAATAGG AAACCTCACA AOGACATACT TTCCTCCCCT CCACATGGCT 
ACCCACICAG GAAOCAAAAA TACTTTCACC TGCAGCTAAC CAACACAAAT TACTTAAAAC 
CCTTCACCAA ACCTTCCACT TAGGCATTCA TAGCACCCAT CAGATCGCCA AATTATTATT 
TACTGGACCA GGCCTTTTCA AAACTATCAA CAACATAGTC AGGGGCTGTG AAGTGTGCCA 
AAGAAATAAT 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113- 
Leu lie Clu Cly Pro Leu Val Trp Cly A.n Pro Leu Trp Glu Thr Ly. 

Pro Cln Tyr Ser Ala Gly Ly. He Glu Xaa Glu Thr Ser Gin Cly His 

20 25 30 

Thr Phe Leu Pro Ser Arg Trp Leu Ala Thr Glu Glu Gly Ly. lie Leu 

35 40 45 

Ser Pro Ala Ala A.n Cln cin Lye Leu Leu Lys Thr Leu Hie Cln Thr 

50 55 60 

Phe Hi. Leu Gly He Asp Ser Thr Hi. Cln Met Ala Ly. Leu Leu Phe 

" 70 " 80 

Thr Cly Pro oly Le U Phe Ly. Thr He Ly. Ly. II. VmX * ro Gly Cy , 

85 90 9S 

Clu Val Cy. Cln Arg A.n Aon 

100 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 5 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



60 
120 
180 
240 
300 
310 
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CCCTCTATCT TTAACCTCCT TCTTAACTTT GTCTCTTCCA GAATCAAAAC TCTAAAACTA 
CAAATTGTTC TTCAAATCGA CCACCAGATG GAGTCCATGA CTAAGATCCA CCGTCGACCC 
CTCGACCCGC CTGCTAGCCC ATGCTCCGAT GTTAATGACA TTGAAGGCAC CCCTCCCGAC 180 
GAAATCTCAA CTGCACAACC CCTACTATGC CCCAATTCAC CGGGAAGCAG TTACACCCCT 240 
CATCAGCCAA CCTCCCCAAC AGCACTTGGG TTTTCCTCTT GAGAGGGGGG ACTGACACAC 
AGO ACT AG CT CGATTTCCTA GGCCAACGAA GAATCCCTAA CCCTAGCTCG CAAGCTGACT 
GCATCCACCT CTAAACATGG GCCTTGCAAC TTAGCTCACA CCCGACCAAT CACAGACCTC 420 
ACTAAAATCC TAATTAGGCA AAAATAGGAC GTAAAGAAAT AGCCAATCAT CTATTGCCTC 480 
ACACCACAGC GCGAGGGACA AGGATCGGGA TATAAACCCA GGCATTCGAG CCGGCAACGG S40 
CAACCCCCTT TGGGTCCCCT CCCTTTCTAT GCGCGCTCTG TTTTCACTCT ATTTCACTCT 



60 
120 



300 
360 



600 



10 



ATTAAATCTT GCAACTGAAA AAAAAAAAAA AAAAA $ 35 

(2) INFORMATION FOR SEQ ID NO: 115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 115: 
Pro cys lie Pho Aan Lou Lou Val Lye Phe Val ser Ser Arg He Lys 

1 5 10 is 

Thr val Lye Leu Gin tie Val Leu Gin Met Clu Hie Gin Met Clu Ser 

20 25 30 

Met Thr Lye tie Hie Arg Gly Pro Leu A»p Arg Pro Ala Ser Pro Cye 

35 40 45 

Ser Aep Val Aan Aep Zle Glu Gly Thr Pro Pro Glu Glu Zle Ser Thr 

90 55 60 

Ala Gin Pro Leu Leu Cye Pro Aen Ser Ala Gly Ser Ser 

65 70 75 
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(2) INFORMATION FOR SEQ ID NO : 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 116: 
TGGGGTTCCA TTTGTAAGAC CATCTGTAGC TT 3 2 



(2) INFORMATION FOR SEQ ID NO : 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleotide 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



117: 




CGCTCTCACT 


60 


TCTATGAACA 


120 


CCCAAACTCC 


180 


TcrrrccATc 


240 


TCGAGGACTT 


300 


GGGTGGAATT 


360 


CCGGGGACAT 


420 


CCTCCGTACC 


480 


CCTCTCACCC 


540 


CATTTCAATC 


600 


CCTTTTAGTA 


660 


TOTAAAATTT 


720 


TCCCACACGA 


780 


TCATTGTTTG 


840 


GACCATCTAC* 


900 


ACTACCCATT 


960 


TGGCACTATC 


1020 



20 
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ACAACCTCTA CTCACTTCTA CTACAAACTA TCTCAAGAAA TAAATCCTCA CATGCAACAG 1080 

CTCACTCACT CCCTCCTCAC CTTCCAACAT CAACTTAACT CCCTAGCACC ACTACTCCTT 1140 

CAAAATCCAA G AC CTTTAC A CTTGCTAACC GCCAAAAGAG GGGGAACCTG TTTATTTTTA 1200 

CCACAAGAAC CCTGTTATTA TGTTAATCAA TCCAGAATTG TCACTGAGAA AGTTAAAGAA 1260 

ATTCCAGATC GAATACAATC TAGACCAGAG CAGCTTCAAA ACACCGAACG CTCGGGCCTC 1320 

CTCAGCCAAT GGATGCCCTG GGTTCTCCCC TTCTTAGGAC CTCTAGCAGC TCTAATATTG 1380 

TTACTCCTCT TTGGACCCTG TATCTTTAAC CTCCTTGTTA AGTTTGTCTC TTCCAGAATT 1440 

GAAGCTGTAA AGCTACACAT GCTCTTACAA ATCGAACCCC A 14ai 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

10 . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Mat Ala Leu Pro Tyr His Thr Phe Leu Phe Thr Val Leu Leu Pro Pro 

1 5 10 is 

Phe Ala Leu Thr Ala Pro Pro Pro Cys Cys Cys Thr Thr Ser Ser Ser 

20 25 30 

Pro Tyr Gin Clu Phe Leu Xaa Arg Thr Arg Leu Pro GXy Aan He Aap 

« 40 45 

Ala Pro Ser Tyr Arg Ser Leu Ser Lye Gly Asn ser Thr Phe Thr Ala 

50 55 60 

His Thr Hie Met Pro Arg Aen Cye Tyr Asn Ser Ala Thr Leu Cye Met 
65 70 75 80 

Hie Ala Aen Thr Hie Tyr Trp Thr Gly Lya Met He Aan Pro Ser Oym 

85 90 95 

Pro Gly Gly Leu Gly Ala Thr Vei Cye Trp Thr Tyr Phe Thr Hie Thr 

100 105 no 

Ser Met Ser Asp Gly Gly Gly lie Gin Gly Gin Ala Arg Glu Lye Gin 

115 120 125 

Val Lye Glu Ala He Ser Gin Leu Thr Arg Gly Hie Ser Thr Pro Ser 

130 135 140 

Pro Tyr Lys Gly Leu Val Leu Ser Lye Leu Hie Glu Thr Leu Arg Thr 
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145 "0 155 i 6 o 

His Thr Arg Leu Val ser Leu Phe Asn Thr Thr Leu Thr Arg Leu His 

165 170 175 

Glu Val Ser Ala Gin Asn Pro Thr Asn Cys Trp Met cys Leu Pro Leu 

180 185 190 

His Phe Arg Pro Tyr lie Ser lie Pro Val Pro Glu Gin Trp Asn Asn 

195 200 205 

Phe Ser Thr Glu He Asn Thr Thr Ser Val Leu Vai Cly Pro Leu Val 

210 215 220 

Ser Asn Leu Clu He Thr His Thr Ssr Asn Leu Thr Cys Val Lys Phe 
225 230 235 240 

Ser Asn Thr He Asp Thr Thr Ser Ser Gin Cys He Arg Trp Val Thr 

245 250 255 

Pro Pro Thr Arg He Val Cys Leu Pro Ser Gly He Phe Phe Val Cys 

260 265 270 

Gly Thr Ser Ala Tyr His Cys Leu Asn Cly Ser Ser Glu Ser Mot Cys 

275 280 285 

Phe Leu Ser Phe Leu Val Pro Pro Met Thr He Tyr Thr Clu Cln Asp 

290 295 300 

Leu Tyr Asn His Val Val Pro Lys Pro His Asn Lys Arg Val Pro He 
305 310 315 320 

Leu Pro Phe Val He Arg Ala Gly Val Leu Gly Arg Leu Cly Thr Gly 

325 330 335 

He Gly ser Zle Thr Thr Ser Thr Gin Phe Tyr Tyr Lys Leu ser Gin 

340 345 350 

Glu He Asn Gly Asp Met Glu cln Val Thr Asp Ser Leu Val Thr Leu 

355 360 365 

Gin Asp Gin Leu Asn Ser Leu Ala Ala Val Val Leu Cln Asn Arg Arg 

370 375 380 

Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr Cys Leu Phe Leu 
385 390 395 400 

Gly Glu Glu Arg Cys Tyr Tyr Val Asn Gin Ser Arg He Vai Thr Glu 

405 410 415 

Lys Val Lys Clu He Arg Asp Arg He Cln Cys Arg Ala Glu Glu Leu 

420 425 430 

Gin Asn_ Thr Glu Arg Trp Cly Leu Leu Ser Gin Trp Met Pro Trp Val 
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435 440 
L«u Pro Phe Leu Cly Pro Leu Ala 

450 455 
Cly Pro Cys He Phe Aen Leu Leu 
465 4 70 
Clu Ala Val Lys Leu Gin Met Val 
485 



44S 

Ala Leu He Leu Leu Leu Leu Phe 
460 

Val Lye Phe Val Ser Ser Arg He 

47S 480 
Leu Gin Met Glu Pro 
490 



(2) INFORMATION FOR SEQ ID NO : 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TCAAAATCGA AGAGCTTTAG ACTTGCTAAC CG 32 



(2) INFORMATION FOR SEQ ID NO : 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 2 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 



TCAAAATCGA 


AGAGCTTTAG 


ACTTGCTAAC 


CGCCAAAAGA 


GGGGGAACCT 


GTTTATTTTT 


60 


ACCCGAAGAA 


TGCTGTTAGT 


ATCTTAATCA 


ATCTGGAATC 


ATTACTGAGA 


AAGTTAAAGA 


120 


AATTTGAGAT 


CGAATATAAT 


GTAGAGCAGA 


GGACCTTCAA 


AACACTGCAC 


CCTGGGGCCT 


180 


CCTCACCCAA 


TGGATGCCCT 


GCACTCTCCC 


CTTCTTAGCA 


CCTCTAGCAG 


CTATAATATT 


240 


TTTACTCCTC 


TTTGGACCCT 


GTATCTTCAA 


CTTCCTTGTT 


AAGTTTGTCT 


CTTCCAGAAT 


300 


TOAACCTCTA 


AAGCTACAAA 


TACTTCTTCA 


AATGCAACCC 


CAGATGCACT 


CCATCACTAA 


360 


AATCTACCCT 


GGACCCCTCG 


ACCGGCCTGC 


T AG ACT ATO C 


TCTOATGTTA 


ATGACATTGA 


420 


AGTCACCCCT 


CCCGAGGAAA 


TCTCAACTGC 


ACAACCCCTA 


CTACACTCCA 


ATTCAGTAGG 


480 


AAGCAGTTAG 


AGCAGTTCTC 


AGCCAACCTC 


CCCAACAGTA 


CTTGCGTTTT 


CCTGTTGAGA 


540 


GGGTCGACTC 


AGACACAGGA 


CTAGCTGGAT 


TTCCTAGGCT 


GACTAAGAAT 


CCCNAACCCT 


600 
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ANCTGGGAAG GTCACCCCAT CCATCTTTAA ACATCGCGCT TGCAACTTAG CTCACACCCC 660 
ACCAATCAGA GAGCTCACTA AAATGCTAAT CAGGCAAAAA CAGGAGGTAA AG CAA TAG CC 720 
AATCATCTAT T6CCTCAGAG CACAGCGGGA AGGACAAGGA TTGGCATATA AACTCACCCA 780 
TTCAAGCCAC CAACAGCAAC CCCCTTTCGG TCCCCTCCCA TTGTATGGGA GCTCTGTTTT 840 
CACTCTATTT CACTCTATTA AATCATGCAA CTGCACTCTT CTG GTCCGTG TTTTTTATGG 900 
CTCAACCTCA GCTTTTGTTC GCCATCCAGC ACTGCTCTTT GCCACCGTCA CAGACCCGCT 960 
GCTGACTTCC ATCCCTTTGG ATCCAG C AG A GTGTCCACTG TGCTCCTGAT CCAGCGAGGT X020 
ACCCATTGCC ACTCCCCATC AGG CTAAAGG CTTGCCATTG TTCCTGCATG GCTAAGTC CC 1080 
TGGGTTTGTC CTAATAGAAC TGAACACTCC TCACTCCCTT CCATGGTTCT CTTCCATGAC 1140 
CCACGGCTTC TAATACACCT ATAACACTCA CCCCATCCCC CAAGATTCCA TTCCTTCCTA 1200 
TCTGTCAGGC CAACAACCCC ACGTCACAGA ANGTGAGCCT TCCCACCATT TCGCAAGTGG 1260 
CCCACTGCCA TTTTGGTACC GCCCCACCAC CATCTTCCGA GCTGTGGGAG CAAGGATCCC 1320 
CCAGTAACA 

1329 

(2) INFORMATION FOR SEQ ID NO : 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 121: 
Gin Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr 
X 5 10 is 

Cys Leu Phe Leu Gly GIu Clu Cys Cys Xaa Tyr Val Asn Gin Ser Gly 

20 25 30 

He lie Thr Clu Lye Val Lys Clu lie Xaa Asp Arg xie Xaa Cya Arg 

35 40 as 

Ala Glu Asp Leu Gin Asn Thr Ala Pro Trp Gly Leu Leu Ser Gin Trp 

50 55 60 

Met Pro Trp Thr Leu Pro Phe Leu Gly Pro Leu Ala Ala lie lie Phe 
65 70 75 80 

l*eu Leu Leu Phe Gly Pro Cya He Phe Aan Phe Leu Val Lys Phe Val 

SS 90 95 

Ser Ser Arg He Clu Ala Val Lys Leu Gin He Val Leu Cln Het Clu 
100 105 no 
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Pro Gin Met Gin Ser Met Thr Lys He Tyr Arg Gly Pro Leu Asp Arg 

115 120 125 

Pro Ala Arg Leu Cys Ser Asp Val Asn Asp He Clu Val Thr Pro Pro 

130 135 140 

Glu Clu lie Ser Thr Ala Gin Pro Leu Leu Hia Ser Asn Ser Val Gly 
145 150 155 160 

Ser Ser 

(2) INFORMATION FOR SEQ ID NO: 12 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

5 (B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
10 GGCATTGATA GCACCCATCA G 21 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

15 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
2 0 CATGTCACCA GGGTGGAATA G 21 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: base pairs 

25 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
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(2) INFORMATION FOR SEQ ID NO : 12 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 58 base pairs 

(B) TYPE: nucleotide 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

GCCATTCATA CCACCCATCA CATGCCCAAA TCATTAT XT A CTGGACCAGC CCTTT T CAAA 60 

ACTATCAACC AGATAGGGCC CGTGAAGCAT CCCAAACAAA TAATCCCCTG CCTTATCCCC 120 

ATCTTCCTTC ACGAGAACAA ACAACACGCC ATTACCCAGG GGAAGACTGG CAACTAGATT 180 

TTACCCACAT GGCCAAATGT CAGGGATTTC AG CATCT ACT AGTCTCCGCA GATACTTTCA 240 

CTGGTTGGGT GGAGTCTTCT CCTTGTAGGA CAGAAAAGAC CCAAGAGGTA ATAAAGGCAC 300 

TAATGAAATA ATTCCCAGAT TTCCACTTCC CCCACCATTA CACCGTGACA ATCCCCCCCC 360 

TTTCAACGCT CCACTAACCC ACCCACTATC CCACCTCTTA GCCATACAAT ATCACTTACA 420 

CTCTGCCTGG ACCCCACAAT CCTCCACAAA AGTCAAGAAA ATGAATGAAA CACTCAAAGA 480 

TCTAAAAAAG CTAACCCAAC AAACCCACAT TGCATGACCT GTTCTGTTGC CTATAACCTT 540 

ACTAAGAATC CATAACTATC CCCCAAAAAG CAGGACTTAG CCCATACGAO ATGCTATATG 600 

GATCGCCTTT CCTAACCAAT GACCTTGTGC TTGACTGAGA AATGGCCAAC TTAGTTGCAG 660 

ACATCACCTC CTTAGCCAAA TATCAACAAG TTCTTAAAAC ATCACAGGGA ACCTCTCCCC 720 

GAGAGGAGGG AAAGGAACTA TTCCACCCTG GTGACATG 75 B 

10 (2) INFORMATION FOR SEQ ID NO : 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CGGACATCCA AAGTGATGGG AAACG 2 5 

2 0 (2) INFORMATION FOR SEQ ID NO : 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
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GGACAGGAAA GTAAGACTGA GAAGGC 2 6 

(2): INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CCTAGAACGT ATTC TGG AG A ATTGGG 2 6 

(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

TGGCTCTCAA TGGTCAAACA TACCCG 2 6 

(2) INFORMATION FOR SEQ ID NO : 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 1511 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CCTAGAACGT ATTCTGGAGA ATTGCGACCA ATCTGACACT CAGACGCTAA GAAAGAAACG 60 
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ATTTATATTC TTCTCCACTA CCCCCTCCCC ACAATATCCT CTTCAACGCA CACAAACCTC 120 
GCTTCCTCAC CCAACTATAA ATTATAACAT CATCTTACAC CTAGACCTCT TCTCTACAAA 100 
OMCOOCAAA- TCCACTCAAC TCCCATATCT GCAAACTTTC TTTTCATTAA G AG ACAACTC 240 
ACAATTATGT AAAAAGTCTG GTTTATGCCC TACAGGAAGC CCTCAGAGTC CACCTCCCTA 300 
CCCCAGCGTC CCCTCCCCGA CTCCTTCCTC AACTAATAAG GACCCCCCTT TAACCCAAAC 360 
GGTCCAAAAG GAGATAGACA AAGGGGTAAA CAATGAACCA AAGAGTGCCA ATATTCCCCG 420 
ATTATGCCCC CTCCAAGCAC TCAGAGCAGG AGAATTCCCC CCAGCCAGAG TGCCTGTACC 480 
TTTTTCTCTC TCAGACTTAA AGCAAATTAA AATAGACCTA GGTAAATTCT CAGATAACCC 540 
TGACCCCTAT ATTGATGTTT TACAAGGGTT AGGACAATCC TTTGATCTCA CATCGAGAGA 600 
TATAATCTTA CTACTAAATC AGACACTAAC CCCAAATGAC AGAACTGCCG CTGTAACTGC 660 
ACCCCGACAG TTTCGCCATC TTTCGTATCT CAGTCAGGCC AACAATAGGA TGACAACAGA 720 
GCAAAGAACA ACTCCCACAG CCCAGCAGGC ACTTCCCAGT GTACACCCTC ATTGGGACAC 780 
AGAATCAGAA CATGGAGATT GGTCCCACAA ACATTTC CTA ACTTCCGTGC TAGAAGGACT 840 
GAGGAAAACT AGGAAGAAGC CTATGAATTA CTCAATGATG TCCACTATAA CACAGGG AAA 900 
GGAAGAAAAT CTTACT G CTT TTCTGGACAG ACTAAGGGAG GCATTGAGGA AGCATACCTC 960 
CCTGTCACCT GACTCTATTG AAGGCCAACT AATCTTAAAG GATAAGTTTA TCACTCAGTC 1020 
ACCTCCAGAC ATTAGAAAAA ACTTCAAAAG TCTGCCTTAG GCCCGGAGCA GAACTTAGAA 1080 
ACCCTATTTA ACTTC6CATC CTCAGTTTTT TATAATACAC ATCAGGAGGA GCAGGCGAAA 1140 
CGGGACAAAC GGCATAAAAA AAAAAGGGGG GGTC C ACT AC TTTAGTCATG GCCCTCAGGC 1200 
AAGCAGACTT TGGAGGCTCT GCAAAAGGGA AAAGCTGGGC AAATCAAATG CCTAATAGGG 1260 
CTGG CTTCCA GTGCGGTCTA CAAGGACACT TTAAAAAAGA TTATCCAAGT AG AAAT AAC C 1320 
CGCCCCCTTG TCCATGCCCC TTACGTCAAG GGAATCACTC GAAGGCCCAC TGCCCCAGGG 1380 
GATGAAGATA CTCTGAGTCA GAAGCCATTA ACCAGATGAT CCAGCAGCAG CACTGAGGCT 1440 
GCCCGGGGCG ACCCCCAGCC CATGCCATCA CCCTCACAGA GCCCCGGGTA TGTTTGACCA IS 00 
TTGAGAGCCA A 1S11 

(2) INFORMATION FOR SEQ ID NO : 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 131: 
Lou Clu Arg lie L*u Glu A<n Trp Asp Gin Cys Asp Thr Gin Thr Lsu 

15 10 IS 
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Arg Lys Lys Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr 

20 25 30 

Pro Leu Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr 

35 40 45 

Asn Ha He Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp 

SO 55 60 

Ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn ser 
65 70 75 80 

Gin Leu Cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser 

85 90 95 

Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn 



Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys Gly 



Val Asn Asn Glu Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu 

130 135 140 

Gin Ala val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro val Pro 
145 150 155 160 

Phe Ser Leu Ser Asp Leu Lys Gin He Lys He Asp Leu Gly Lys Phe 

165 170 175 

Ser Asp Asn Pro Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin 

180 105 190 

Ser Phe Asp Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr 

195 200 205 

Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe 

210 215 220 

Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu 
225 230 235 240 

Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro 

245 250 255 

Hie Trp Asp Thr Glu Ser Glu His Gly Asp Trp Cys Hie Lys His Leu 

260 265 270 

Leu Thr Cys Val Leu Glu Gly Leu Arg Lye Thr Arg Lye Lye Pro Met 

275 280 285 

Asn Tyr Ser Met Met Ser Thr He Thr Gin Gly Lys Glu Glu Asn Leu 
290 295 300 



100 



105 



110 



115 



120 



125 
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Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser 

305' 310 315 320 

Leu Ser Pro Asp Ser lie Glu Gly Gin Leu lie Leu Lys Asp Lys Phe 

325 330 335 

5 lie Thr Gin Ser Ala Ala Asp lie Arg Lys Asn Phe Lys Ser Leu Pro 

340 345 350 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

TGCTGGAATT CGGGATCCTA GAACGTATTC 30 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

AGTTCTGCTC CGAAGCTTAG GCAGACTTTT 30 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 398 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 
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1 5 10 15 

Arg Gly Ser Hia Met Ala Ser Mec Thr Gly Gly Gin Gin Met Gly Arg 

20 " 30 

lie Leu Glu Arg He Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr 

35 40 45 

Leu Arg Lys Lys Arg Fhe He Phe Phe Cys Ser Thr Ala Trp Pro Gin 

50 55 60 

Tyr Pro Leu Cln Gly Arg Clu Thr Trp Leu Pro Clu Gly Ser He Aen 
65 70 75 80 

Tyr Asn I la He Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys 

85 90 95 

Trp ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn 

100 105 110 

Ser Gin Leu Cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin 

"5 120 X25 

Ser Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro s«r Ser Thr 

130 135 140 

Asn Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys 
145 ISO 155 160 

Gly val Asn Asn Glu Pro Lys ser Ala Asn He Pro Arg Leu Cys Pro 

165 170 175 

Leu Gin Ala Val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val 

ISO IBS 190 

Pro Phe Ser Leu Ser Asp Leu Lys Cln He Lys He Asp Leu Gly Lys 

1^5 200 205 

Phe Ser Asp Asn Pro Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly 

2i0 215 220 

Gin ser Phe Asp Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin 
225 230 235 240 

Thr Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu 

245 250 255 

Phe Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr 

260 265 270 

Clu Clu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp 

27S 280 285 

Pro His Trp Asp Thr Glu Ser Glu His Gly Asp Trp Cys His Lye His 
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290 295 300 

Leu Leu Thr cys Val Leu Glu CXy Leu Arg Lys Thr Arg Lys Lys Pro 
305 310 315 320 

Met Asn Tyr Ser Met Met Ser Thr lie Thr Gin Gly Lys Glu Glu Aan 

325 330 335 

Leu Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr 

340 345 350 

Ser Leu Ser Pro Asp Ser lie Glu Gly Gin Leu lie Leu Lys Asp Lys 

355 360 365 

Phe lie Thr Gin Ser Ala Ala Asp lie Arg Lys Asn Phe Lys Ser Leu 

370 375 380 

Pro Lys Leu Ala Ala Ala Leu Glu His His His His His His 
385 390 395 

(2) INFORMATION FOR SEQ ID NO : 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 7: 
Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lis Lou Glu Arg 
15 10 15 

He Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys 

20 25 30 

Arg Phe He Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin 

35 40 45 

Cly Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr Asn He tie 

50 55 60 

Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val 
65 70 75 00 

Pro Tyr Val Gin Thr Phe Phs Ser Leu Arg Asp Asn Smr Gin Lsu Cys 

85 90 95 

Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin ser Pro Pro Pro 
100 105 110 
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Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser ser Thr Asn Lys Asp Pro 

115 120 125 

Pro Leu Thr Gin Thr Val Gin Lys Glu lie Asp Lys Gly Val Asn Aon 

130 135 140 

Glu Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu Gin Ala Val 

145 150 155 160 

Arg Gly Cly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu 

165 170 175 

Ser Asp Leu Lys Gin He Lys lie Asp Leu Cly Lys Phe Ser Asp Aan 



Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr Leu Thr Pro 

210 215 220 

Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu 
225 230 235 240 

Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu Glu Arg Thr 

245 250 255 

Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp 

260 255 270 

Thr Glu Ser Glu His Gly Asp Trp Cys His Lys His Lau Leu Thr Cys 

275 260 285 

Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser 

290 295 300 

Met Met Ser Thr He Thr Gin Cly Lys Glu Glu Asn Leu Thr Ala Phe 
305 310 315 320 

Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro 

325 330 335 

Asp Ser He Glu Gly Gin Leu He Leu Lys Asp Lys Phe He Thr Gin 

340 345 350 

Ser Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala 

35S , 360 365 

Ala Ala Leu Glu His His His His His His 
370 375 

(2) INFORMATION FOR SEQ ID NO: 13 8: 
(i) SEQUENCE CHARACTERISTICS: 



180 185 190 

Pro Asp Cly Tyr He Asp Val Leu Gin Gly Leu Cly Gin Ser Phe Asp 
195 200 205 



5 



(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 



REPLACEMENT SHEET (RULE 26) 



- 18 - 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 8: 
CTTGGAGGGT GCATAACCAG GGAAT 2 5 

5 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 
TGTCCGCTGT GCTCCTGATC 2 0 

15 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleotide 

2 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 140: 
CTATGTCCTT TTGGACTGTT TGGGT 2 5 

25 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 64 base pairs 

(B) TYPE: nucleotide 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
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TGTCCGCTGT CCTCCTGATC CACCACAGGC CCCCATTCCC TCTCCCAATT CCCCTAAACG 60 

CTTGCCATTC TTCCTCCACA GCTAACTCCC TCCCTTCATC CTAATCCACC TCAACACTAC 120 

TCACTCGGTT CCXCCGTTCT CTTCCATGAC CCATCCCTTC TAATXCACCT ATAACACTCA ISO 

CICCATCCTC CAAGATTCCA TTCCTTGCAA TCCGTG AG AC CAAGAACCCC AGGTCAGAGA 240 

ACACAAGGCT TGCCACCATG TTCGAAGCAG CCCACCACCA TTTTGCAACC AGCCCGCCAC 300 

TATCTTGGGA GCTCTGGCAG CAAGGACCCC AGGTAACAAT TTGGTGACCA CGAAGGGACC 360 

TCAATCCCCA ACCATGAAGG GATCTCCAAA GCAATTGGAA ATGTTCCTCC CAAGGCAAAA 420 

ATGCCCCTAA GATGTATTCT CGAGAATTGG GACCAATTTG ACCCTCACAC AGTAAGAAAA 480 

AAATGACTTA TATTCTTCTG CAGTACCGCC CTGCCCACGA TATCCTCTTC AAGGGGGAGA S40 

AACCTGGCCT CCTCACGGAA GTATAAATTA TAACACCATC TTACACCTAC ACCTCTTTTC 600 

TAGAAAACCA CGCAAATGGA GTGAACTGCC ATATTTACAA ACTTTCTTTT CATTAAAAG A 660 

CAACTCGCAA TTATCTTAAC AGTGTGATTT GTGTTCCTAC ACGGAAGCCC TCAGATTCTA 720 

CTCCCCACCC CCGCCATCTC CCCTCAATCC CTCCCCAACT TATT 754 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

TGTCCCCTGT OCTCCTGATC CAGCACAGCC GCCCATTGCC TCTCCCAATT GGGCTAAAGG 60 

CTTCCCATTG TTCCTCCACA GCTAACTCCC TGGGTTCATC CTAATCGAGC TGAACACTAC 120 

TCACTOGGTT CCACGGTTCT CTTCCATGAC CCATGGCTTC TAATAGACCT ATAACACTCA 180 

CTGCATGGTC CAAGATTCCA TTCCTTGCAA TCCGTG ACAC CAAGAACCCC AGGTCAGAGA 240 

ACACAAGGCT TCCCACCATO TTCCAACCAC CCCACCACCA TTTTCCAAGC GCCCCCCCAC 300 

TATCTTGGGA CCTCTGOGAO CAAGGACCCC CACCTAACAA TTTCCTCACC ACGAAGCGAC 360 

CTCAATCCOC AACCATCAAG GGATCTCCAA AGCAATTGGA AATGTTCCTC CCAAGGCAAA 420 

AXTOCCCCTA AGATGTATTC TGGAGAATTG GGACCAATCT GACCCTCAGA CAOTAAGAAA 480 

AAAAATOACT TATATTCTTC TGCAGTACCG CCTGGCCACG GATATCCTCT TCAAGOOOGA 540 

OAAACCTOGC CT C CT G AGGC AAGTATAAAT TATAACACCA TCTTACAOCT AGACCTGTTT 600 

TGTAOAAAAG GAGOCAAATG GAGTGAAGTG CCATATTTAC AAACTTTCTT TTCATTAAAA 660 

GACAACTCGC AATTATGTAA ACAGTCTGAT TT G T C T C CTA CACGAAGCCC TCAGATCTAC 720 

CTCCCTACCC CCCCATCTCe CTGACTCCTT CCCCAACTAA TAAGGACCCA CTTCACCCCA 780 



AACAGTCCAA AAGGACATAG 



800 
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10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

COTIGTAICT TTAAGCTU1T 'lUl'^AAGTIT CTCICITCCA GAA1CAAAAC 50 
PCIF NLL VKF VSSR IKT 
PVS LTSL LSL SLP ESKL 
LYL .PP C.VC LFQ NQN 

TCIAAAACTA. CAAATIGTTC TTCAAATGGA QCACCAGATG GAGTCCATGA 100 
VKL Q I V L QME HQ M ESMT 
. NY KLF FKWS TRW SP. 
CKTTNCS SNGAPDGVHD 

CTAAGATOCA 00GIGGAO3C CIGGAQ333C CTGCTAGCCC ATGCTCCGAT 150 

KIH RGP LDRP ASP CSD 
LRST VDP WTG L LAH APM 
. DP PWTP GPA C.P MLRC 

GTTAATGACA TTGAAGGGAC OGCIDCX3GAG GAAATCICAA CIQCACAAQC 200 
VNDI EGT PPE EIST AQP 
LMT LKAP LPR KSQ LHN P 
. .H .RH PSRG NLN CTT 

CCEACTA3GC CCCAATICAG GG3GAAGCAG TEAGAGCX3GT CATCAGCCAA 250 
LLC PNSA GSS .SG HQPT 
YYA PIQ REAV RAV ISQ 
PTMP QFS G KQ LERS SAN 

CCIDOCCAAC AGCACTTGGG TmUL ' lUIT GAGAGGGGGG ACTGAGAGAC 300 

SPT ALG FSC. EGG LRD 
PPQQ HLG FPV ERGD .ET 
LPN STWV FLL RGG TERQ 

AGGACTBGCT GGAITIUL'IA GGCCAAOGAA GAATCCCEAA QCCIBGCIGG 350 
RTSW IS. ANE ESLS LAG 
GLA GFPR PTK NP. A.LG 
D.L DFL GQRR IPK PSW 
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PTN QRAH . M A N.A K I G G 



GTAAAGAAAT AGCCAATCAT CEATIGOCTG AGAQCACAGC G3GAG3GACA 

VK oV PI1 YCL RAQR E G Q 
SQSS I A . EHS GRD K 
KEI ANH LLPE STA GGT 



AGSATO33GA TATAAACD2A. GGCATTCGAG CGGGCAACGG CAACCCCCIT 
GSG YKPR HSS RQR Q PPL 
DRD INP GIRA GNG N P L 
RIGI • T Q AFE PATA TPF 



1GGGTOCXXT CCL'mUiAT GGGCGCICIG TITICACICT ATTICACICr 
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10 20 30 40 50 

4567890 1234567SQn 12345fi7RQn 12345fi7RQn I2345fi7flqn 
KIQGCCeVCC CTIATCATAC T1T1U1L.TJ.T ACIGTTCTCT TACCCCCTTT 
MALP YHT FLF T V L L PPF 
WPS LIIL FSL LFS 
GPP LSY FSLY C 
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TCATATAGGA GTITATCTAA GGGAAACTQC AOCTTCACTG CCCACACOCA 
SYRS LSK GNS TFTA HTH 
HIG VYLR ETP PSL PTPI 
I ' • E F I . GKLH LHC PHP 

TATGOCXXX3C AACIGCTATA ACICIGOCAC TCTTTGCATG CATGCAAATA 
MPR NCYN SAT LCM 
CPA TAI TLPL FAC 
YAPQ L L LCH SLH 
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CTCATTATTG GACAGGGAAA ATGATTAATC CTAGTIGTCC TGGAGGACTT 

HYW TGK MINP SCP GGL 
LIIG QGK .LI LVVL EDL 
SLL DREN D.S .LS WRTW 

GGAGCCACTG TCIGTTGGAC TTACITCACC CATACCAGTA TGTUIGATGG 
G A T V CWT YFT HTSM SDG 
EPL SVGL TSP IPV CLMG 
SHC LLD LLHP YQY V .W 
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123 4567R90 1234 5678 90 1234567890 1234567890 1 ?3d567RQn 

ACXnCAOCIG TGTD^AAATTT AGCAATACEA. U^GACACAAC CAGCTCCCAA 75( 

LTC VKF SNTI DTT SSQ 
TSPV N L AIL . T Q P APN 

PHL CKI. QYY RHN QLPM 

TQCAICAQGT Q3GTCAACACC TO32ACA03^ ATAGICIGCC TACQCICAGG 80C 
CIRW VTP PTR IVCL PSG 
ASG G.HL PHE . S A YPQE 
HQV GNT SHTN SLP TLR 

AATALLTmT GICTCIGGm CCTCAQCCIA TCATIGTTIG AATGGCICIT 850 
IFF VCGT SAY HCL NGSS 
YFL S V V PQPI IV. MAL 
NIFC LWY LSL S LFE WLF 

CAGAAICEAT (JIULTIUL'IU TCATIUITAG TOOOOOCTAT GACCATCTAC 900 

ESM CFL SFLV PPM TIY 
QNLC ASS HS. CPL. PST 
RIY VLPL ILS APY DHLH 

ACIGAACAAG ATTTATACAA TCATCID3EA GC3AAQ0GCX: ACAACAAAAG 950 
TEQD LYN H V V PKPH NKR 
LNK IYTI MSY LSP TTKE 
• TR F I Q SCRT .AP QQK 

AGTAararr cnarrrnG teatcagagc aggagigcea gqcagaciag 1000 

VPI LPFV IRA GVL GRLG 
YPF FLL LSEQ EC. AD. 
STHS SFC YQS RSAR QTR 

GTACT03CAT TO3ZAGTATC ACAADCICEA CTCAGJL'IL'IA CTACAAACEA 1050 

TGI GSI TTST QFY YKL 
VLAL AVS QPL LSST TNY 
Y W H WQYH NLY SVL LQTI 
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TUICAAGAAA TAAAT3GTGA CATO^AACAG GICACIGACT OCUIGJ1CAC 1100 
SQEI NGD MEQ VTDS LVT 
LKK M V T WNR SLT PWSP 

SRN KW. HGTG H.L PGH 

CTIGCAAGAT CAACTIAACT UULVUAQCAGC AGTEAGTOCTIT CAAAATO3AA 1150 
LQD QLNS L A A VVL QNRR 
CKINLT P. QQ .SFKIE 
LARS T.L PSS SSPS KSK 

GAGCTITAGA LTlUL'UftACC QOCAAAAGAG GG3GAACCIG ' l ' l ' imTl'l ' iA 1200 

ALD LLT AKRG GTC LFL 
EL.T C.P PKE G.EPV Y F 
SFR LANR QKR GNL FIFR 

GGAGAAGAAC GJIUITATIA TUITAAICAA. TOZAGAATIG TCACIGAGAA 1250 
GEER CYY VNQ SRIV TEK 
EKN AVIM LIN PEL SLRK 
RRT LLL C.SI QNC H.E 

AGTEAAAGAA ATIOGAGATC GAATACAAIG TAGAGCAGAG GA G L TIUA AA 1300 
VKE IRDR IQC RAE ELQN 
LKK FEI EYNV EQR SFK 
S.RN SRS NTM S R G ASK 

ACaOOGAftaS CIG3GG0CIC CICAGOCAAT GGATOOOCIG Um'lL'lULXJL' 1350 

TER WGL L SQW MPW VLP 
TPNA GAS SAN GCPG FSP 
HRT LGPP QPM DAL GSPL 

T1L.T12AGGAC C1LVUAGCAGC lUTAATATIG TEACIOTICr TIGaAQOCIG 1400 
FLGP LAA LIL LLLF GPC 
S.D L.QL .YC YSS LDPV 
LRT SSS SNIV TPL WTL 
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10. 20 30 40 50 
12 345678 90 1234567fl QQ 1234567890 1234567890 l?34567RQn 

TarcrnaAC cicuriuim agitigicic ticcagaatt gaagctgtaa 1450 

IFN LLVK FVS SRI EAVK 
SLT SLL SLSL PEL KL. 
YL.P PC. VCL FQN. SCK 

AGCIBCAGAT Q3ICTEACAA A3GGAACCCC A 1481 

LQM VLQ MEP 
SYRW SYK WNP 
ATD GLTN G T P 
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TCAAAATOGA AGAQCTTTAG ACTIGCEAAC CGCCAAAAGA G3GGGAACCT 50 
SKSK SFR LAN RQKR GNL 
QNR RALD LLT AKR GGTC 
K I E EL. TC.PPKEGEP 

ummTiTr agoqgaagaa tgcigteagt aigttaatca atciogaatc 100 

FIF RGRM LLV C.S IWNH 
LFL GEE CC.Y VNQ SGI 
VYF. GKN AVS MLIN LES 

ATEACTGAGA AAGTTAAAGA AATTIGAGAT CGAATAIAAT GTAGAGCAGA 150 

Y.E S.R NLRS NIM .SR 
ITEK VKE I . D RI.C RAE 
LLR KLKK FEI E YN VEQR 

GGACCTICAA AACACIGCAC CXTIGGGGGCT CCICAGGGAA 1GGAIQ0QCT 200 
GPSK HCT LGP PQPM DAL 
DLQ NTAP WGL L SQ WMPW 
TFK TLH PGA S SAN GCP 

GGACTCTQJL ' CriUl ' JA GGA CXJIUiAQCAG CTATAATATT TITACIOCriC 250 
DSP LLRT S SS YNI FT. PL 
TLP FLG PL A A IIF LLL 
GLSP S.D L.Q L.YF YSS 

TTTGGAOOCT GTATCITCAA CT1ULT1U1T AAGTITGTCT LTTUUA GAAT 300 

WTL YLQ LPC. VCL FQN 
FGPC I FN F LV KFVS SRI 
LDP VS ST SLL SLS LPEL 

TGAAGCIGIA AAGCIACAAA TAGTTCITCA AATGGAACCC CAGAIGCAGT 350 
. SCK ATN SSS NGTP DAV 
EAV KLQI VLQ MEP QMQS 
K L SYK F F K WNP RCS 
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FIG 5 (continued) 
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OIATGACIAA AATCEAGOGT GGAOXUIGS AO033CCIQC TAGACIAIGC 400 
HD. NLPW TPG P A C T M L 

MT K IYR GPLD RPA RLC 
P.LK STV DPW TGLL DYA 

TCIGAIUI'IA AT3ACATIGA AGICAGOCTT COOGAGGAAA ICICAACIQC 450 

.C. . H . SHPSRGNLNC 
SDVN DIE VTP PEEI STA. 
LML MTLK S PL PRK SQLH 

ACAACQGCIA CTACACIOCA ATTCAGT&Q3 AAGCAGTTAG AGCAUI'IUIL' 500 
TTPT TLQ FSR KQLE QLS 
QPL LHSN SVG S S . SSCQ 
NPY YTP IQ.E AVR AVV 

AQGCAAGCIC CCCAACAGTA CTIGGGTITT GCIGTIGAGA GG3IGGACIG 550 
ANL PNST WVF L L R GWTE 
PTS PTV LGFS C.E GGL 
SQPP QQY LGF PVER VD. 

AGAGACAGGA CEAGCTQ3AT TIUL'-UAGGCT GACIAAGAAT GOCNAAQQCT 600 

RQD L D FLG. L R I PKP 

RDRT SWI S.A D.ES X S L 
ETG LAGF PRL TKN PXAX 

ANCIG3GAAG GTCACCX3ZAT CCAMLTl'iAA ACAIGGGGCT TGCAACTEAG 650 
XWEG DRI H L TWGL QLS 

XGK VTAS IFK HGA CNLA 
LGR .PH PSLN MGL AT. 

CTCaCftGOOS AOCAATCAGA GAGCICACTA. AAATGCTAAT CAQ3CAAAAA 700 
SHP TNQR AH. NAN QAKT 
HTR PIR ELTK MLI RQK 
LTPD QSE SSL KC.S GKN 
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CAGGAGGTAA AGCAATAOOC AATCATCEAT TGOCIGAGAG CACAGGQ3GA _ 

G G K A I A N H L L PES TAG 
Q EVK Q . p i IY CLRA QR E 
R R . SNSQ SSI A . E HSGK 



AGGACAAGGA TTG3GATATA AACTCAGGCA TTCAAQQCAG CAACAGCAAC 
R T R I G I . TQA FKP aTaT 
GQG LGYK LRH SS Q Q Q q p 
DKD W D I NSGI QAS N S N 



03CCITTGG3 TCCCCTCCCA TIGTATGGGA GCICIGTITT CACICffiTIT 
PFG SPPI VWE L 
PLG PLP LYGS 
PLWV P SH CMG 
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HPP LLF ATVT 
QAE LLF AIHH CCL 



P P S 



CAGAoocxarr g^igacttoc atoocttigg atocagcaga gtgiocacig 

RPA ADFH PFG S 
D P L L T S 
Q T R C .LP 



S R V S T V 
IPLD PAE CPL 
SLW IQQS VHC 



TGCICCIGAT OCAGOGAGGT AOOCATTGO: ACIGCGGATC AGGCIAAAGG 

L L I QRG THCH 
C S . S S E V P 



S R S G . R 

IATPDQAKG 
A P D PARY PLP 



L P I R L K A 



750 



800 



850 



900 



950 



1000 



1050 



WO 99/02666 



PCT/FR98/01460 



12 / 32 
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CTTQDCATIG TTOCT3CATG QCTAAGIGOC T33 J1T1U1 C CIAAIAGAAC 
L A I V PAW L S A WVCP 
L P L FLHG .VP GF 
CHC SCM AKCL 



N R T 
V L I E L 
G L S N 



TGAACACTG3 TCACTGGGTT CTAIQJ l 'l LT CTICCAIGAC CCACQGCITC 
EHW SLGS MVL FHD PRLL 
NTG HWV PWFS SMT HGF 
T L V TGF HGS LP.p TAS 

TAATAGAGCT ATAACACICA OOXATGGGC CAAGATICCA T I C CT1LUIA 

IEL . H S PHGP RFH SLV 
• . S Y NTH RMA Q DSI PWY 
NRA ITLT AWP KIP FLGI 

TCIGIGAGGC CAAGAACOCr AGGTCAGAGA ANGTGAGGCT TGCGACCATT 
SVRP RTP GQR X.GL PPF 
L • G QEPQ VRE X E A CHHL 
CEA KNP RSEX VRL ATI 

TGGGAAGIGG CCCACIQ02A. TITIGGTAGC GGOOCACCAC CATCITG3GA 
GKW PTAI L V A AHH H 
GSG PLP FW.R PTT 
WEVA HCH FGS GPPP 
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GCTGlUJLaAG CAAGGATGOC GCAGTAACA 

CGS KDP PVT 
A V G A RIP Q. 
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ATICTOGAGA ATItX3GACCA ATOTOACACT CftGACOCTAA 50 
PRTY SGE LGP M.HS DAK 
L E R I L E N WDQ CDT QTLR 
• NV FWR IGTN V T L RR. 

GAAAGAAAQ3 ATTEATATIC TICIGCMTA OOGCCILUJC: ACAAEATCCT 100 
KET IYIL LQY RLA TISS 
KKR FIF FCST AWP Q Y P 
ERND LYS SAV PPGH NIL 

CTTCMG33A GAGAAACCTO ULT1UJ11JAG OSAA7TATAA ATTATAA2AT 150 

SRE RNL AS.G KYK L . H 
LQGR ETW LPE G S I N YNI 
F K G EKPG FLR EV. I I T S 

CAICTITOG CmGftULULT TCIGTAGAAA GGA03C32AAA TQGTCIGMG 200 
HLTA RPL L . K GGQM E.S 
ILQ LDLF CRK EGK'WSEV 
SYS .TS SVER RAN GVK 

TOCXZA3MCT GOAAZETIC TTTICATTAA GftGAO^ACIC AZAAITATOT 250 
AIC ANFL FIK R Q L TIM. 
PYV QTF F S L R DNS Q L C 
CHMC KLS F H . ETTH NYV 

AAAAAGIGIG G'lTlAiULXJC TACAG3AMC CCTCSGSGIC CA UL ' lUUL ' lA 300 

KVW FMP YRKP SES TSL 
KKCG LCP TGS P QSP PPY 
KSV VYAL QEA L R V HLPT 

<Y \ l- M GOUlt: OXTODOXA CIULTlUL'll' AACIAAIAA3 ^t|t t n T1 » 350 
PQRP L P D SFL N . . G PPF 
P S V PSPT PSS TNK DPPL 
PAS PPR LLPQ LIR TPL 

TMOD^AAC G3XOAAAAG GBGKEfiGfiCA AfiGGGCTAAA CAATCZWXA 400 
NPN GPKG DRQ RGR Q.TK 
TQT VQK E I D K GVN NEP 
P K R SKR R . T KG.T MNQ 

AAGfiGIQOCA ATOTIOOOCn Al'llAlUULDC CIGGAM3CK3 1G&GAGGM33 450 

ECQ YSP IMPP PSS ERR 
KSAN IPR LCP LQAV RGG 
RVP IFPD YAP SKQ E E E 

lUX'lUl^CC TnTICICTC TCTGRCTEAA 500 
RIRP SQS ACT FFSL RLK 
EFG PARV PVP FSL SD LK 
NSA QPE CLYL FLS QT. 
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AGCAAATTAA AATAGAGCTA OCTAAATICr TCA033CTAT 550 

AN. NRPR . I l R.P RLY 
QIK I D L GKFS DNP DGY 
SKLK . T . VNSQITLTAI 



600 



ATTCATCTIT TPCAP03JTT A3GACAATCC TTIGATCIGA CAIQ3AGAGA 

. C F TRV R T I L . S D MER 
IDVL Q G L GQS FDLT WRD 
L M F YKG. DNP LI. HGEI 

TATAATCTTA CTACTAAATC AGACACEAAC CQZAAATOftS A3AAGIGC0G 650 
YNVT TKS DTN PK.E KCR 
IML LLNQ TLT PNE RSAA 
•CY Y.I RH.P QMR EVP 

CTCTAACIGC AO0003AGAG TTK3C30GATC ' ITlUj i aiLT CKJTCPGGCC 700 
CNC SPRV WRS LVS QSGQ 
VTA ARE FGDL WYL S Q A 
L.LQ PES LAI FGIS.VRP 

AACAAIAGGA TGACAACAGA QGAAMAWZA POTCOCPCPG G0CMCA3QC 750 

Q.D DNR GKNN SHR PAG 
NNRM TTE ERT TPTG QQA 
TIG • Q Q R KEQ LPQ ASRQ 

AGTiaaCAGr CTAGAOaCIX:' ATIQ3GACAC MAATCMAA CAT3G2GATT 800 
SSQC RPS LGH RIRT WRL 
VPS VDPH WDT ESE HGDW 
FPV . ' T L IGTQ NQN MEI 

***iujjaCAA ACRJITIULUA ALTIULUIUC TN3AMG&CT GAQSAAAACT 850 
VPQ TFAN LRA RRT EEN. 
CHK HLL TCVL EGL RKT 
G A T N IC. LAC K D . GKL 

AQGAAGAAQC CI^TOAATIA C1CAA3GATO TCradMBA CTQQG3AA 900 

EEA YEL LNDV HYN TG K 
RKKP MNY SMM ST IT QGK 
GRS L.IT Q.C PL. HRER 

OGAAGAAAAT CITOCIO£IT TICIGGZOtf* POI&OGGPG Q3\TIGAGGA 950 
GRKS YCF SGQ TKGG IEE 
EEN LTAF LDR LRE ALRK 
KKI LLL FW.TD .GR H.G 

A3CATAOOTC CL'lU'lUfcJLT CTCICTATIG AAG30CAACT AATCTUAAG 1000 
A Y L PVT. LY. RPTNLKG 
HTS LSP DSIE GQL ILK 
SIPP CHL TLL KAN. S.R 
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GATPiPCTTTA TCACIC&GIC AOCTOCAGAC ATEH3AAAAA PCTVCMAPG 1050 

• VY HSV SCRH K K LQK 

DKFI TQS A A D IRK N FKS 
ISL SLSQ LQT L E K TSKV 

TCTGCCTEAG GOCD33ftOCA GAACTEPGAA A UULTK1T1A ACTTO32A3C 1100 
SALG PEQ NLE TLFN LAS 
LP. ARSR T.K P Y L TWHP 
CLR PGA E LRN PI. L G I 

CTGACTTTTr TATAATAG^G ATCMGMGA GCAG30GAAA OGQGACAAAC 1150 
SVF YNRD QEE QAK RDKR 
QFF IIE IRRS RRN GTN 
LSFL . .R SGG AGET GQT 

Q3GATAAAAA AAAAAG333G GGIOZACEAC TTTAGICATO QOCCI Cfi OQC 1200 

DKK KRG GPLL . S W PSG 
GIKK KGG VHY FSH.G PQA 
G.K KKGG STT LVM ALRQ 

AAGCftG&CIT TO3MQCICT GCAAAMGGA AAfiG£7IQGG2 AAATCAAATO 1250 
KQTL EAL QKG KAGQ IKC 
SRL WRLC KRE KLG KSNA 
ADF GGS AKGK SWA NQM 

CdAAIAGaG C1UJLT1UJA GIQCTOICEA 0^fiG30VCT TEAAAAAW3V 1300 
LIG LASS A V Y KDT LKKI 
. .G WLP VRST RTL .KR 
PNRA GFQ CGL QGHF KKD 

TraroCAMTT A3AAATAA3Z OJLUJULT1U TXAJQUr TTR03IC3W3 1350 

IQV EIS RPLV HAP YVK 
LSK. K . A A P L SMPL TSR 
YPS R N K P PPC PCP LRQG 

GGAAIUAL'IU GMG300C2C TO OOOC M OG GA3GMGA2A CTC1U&3ICA 1400 
GITG RPT APG DEDT LSQ 
ESL EGPL P Q G MKI L . V R 
N HW KAH CPRG R Y SES 

GMaOCATlA AQ3GATCAT GBCnMGCT UUJUJLJJUL; 1450 

KPL TR.S SSR TEG ARGE 
SH. PDD PAAG LRV PGA 
EAIN QMI QQQ D.GC PGR 

A3GQ0raGGC CATOGCATCA CIX'ICAQGA GG3GGQG?IA 'lU'lTlUACCA 1500 

RQP MPS PSQS PGY V.P 
SASP CHH PHR APGM FDH 
A P A H A I T LTE PRV CLTI 
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FIG 6 (continued) 



10 20 30 40 50 

1234567RQQ !234Sfi7flQ Q 1234567890 1234Sfi7PQn i2Wfn*<m 

TIGBCftODCA A T^TT 

L R A XJ - 



E P 
ESQ 



WO 99/02666 



PCT/FR98/01 



17 / 32 
FIG 7 
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M 1 '? 3 ^ QO^MCAICA TCATCATCAC AGCAGOGGQC TOSIGOOGSg" 
MGSS HHH HHH SSGL VPR 

ATCGC^GCA. TGACIOGTO3 ACAQCAAATC QCTTCQGATCC 
MASM TGG QQM G R I L 

TAGAACCTAT TUTO3AGAAT TO33A02AAT G1GACACTCA GAGGCEAAGA 
LEN WDQ C DTQ TLR 

AAGAMCGAT TTATATTCIT CIGCfiGEAOC OrTOGCCSC AATATOOICT 
IFF CST AWPQ YPL 

TCAAQGGAGA GAAACTTOGC TIOCnGAGGG AA3EAIAAAT mTAACAIC^ 
QGR ETWL PEG SIN Y N I I 

TCTTACAGCT AGAOCTCTIC TGEAGAAAGG A333CAAATC GAGTGAAGTG 
LQL DLF CRKE GKW SEV 

ocaxatoigc aaactticit ticateaaga gacaacicac aateatoeaa 

PYVQ TFF SLR DNSQ LCK 

AAAGTGIG3T TEATO33TEA CAGGAAGOCC TCAGASTCCA. OnOJL'IAGC 
KCG LCPT GSP QSP PPYP 

a^aoasira: ctudoogact ccitoicaa ceaataagga ntri -i . - irjA 

SV P SPT PSST NKD PPL 

AOCCAAAG33 TOZAAAAGGA GATAGAC^AA GG3GTAAACA A1GAACCAAA 
T QTV QKE IDK GVNN EPK 

G^ra32AAr ATKXra»T TATOCDGCCT COtfGCAGIG AGaGGAGGAG 
IPRL CPL Q A V RGGE 

AATlUiOZC AGCCAGAGIU OJlUiaULTl ' ITIL ' IUIL ' RJ AGACZTAAAG 
ARV PVPF SLS DLK 

C&AATIAAAA TAGAQ^TAGG TAAATICTCA GAEAACOCIG AD33CTAIAT 
QIKI DLG KFS DNPD GYI 

IGAivaiiTiA CAAGG3ITAG GACAATOCTT TCMCIGACA. TGGAGAGATA 
QGLG QSF DLT WRDI 

"EAATOmcr ACIAAATCAG AOCTAACCC CAAATCAGAG AAGIGCCQCT 
MLL LNQ TLTP NER SAA 

GiaACTOAG OTSASftGIT 1G303A3CIT TCUiMLl l* G1CAG3CCAA 
VTAA REF GDL WYLS QAN 
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FIG 7 (continued) 
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CAAm33ATC ACAACAGAGG AAAGAACAAC TCCCACA03C CAqSqqcS" 
NRM TTEE RTT PTG QQAV 

TiarflGTGT AGACCCTCAT T33GACACAG AATCAGAACA TGOAGATTOG 
PSV DPH WDTE SEH GDW 

TGOCACAAAC ATITGCTAAC TIUOJIUL'I A GAAGGACTGA GGAAAACTAG 
CHKH LLT CVL EGLR KTR 

GAAGAAGDCr ATGAATTACT CAATGATGTC CACTATAACA CAGGGAAAGG 
KKP MNYS MMS TIT QGKE 

AAGAAAATCr TACTGCTITT CTGGACAGAC TAAGGGAGGC ATIGAQGAAG 
ENL TAF LDRL REA LRK 

CATAGCIOr TGTCACCTGA CTCTATTGAA GGCCAACTAA' TCTTAAAGGA 
HTSL SPD SIE GQLI LKD 

TAAGTTTATC ACTCAGTCAG CTGCAGACAT TAGAAAAAAC TTCAAAAGTC 
K F I TQSA ADI RKN FKSL 

TGCCTAAGCT TGGGGCCGCA CIUGAGCAGC AGCACCACCA CCACTGAGAT 
PKL AAA LEHH HHH H.D 
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FIG 8 
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ATO3CEAGCA TGACTGGTGG ACAG2AAATC Q3ICGGATCC TAGAAOGEAT 50 
MASM TGG QQM GRIL ERI 

TCIGGAGAAT TG33ACCAAT G1GACACTCA. GACGCEAAGA AAGAAACGAT 100 
LEN WDQC DTQ TLR KKRF 

TTAIATlLTr CIGCAGTACC GOTIGXCAC AATAIOCIUr TCAAGGSAGA 150 
IFF CST AWPQ YPL QGR 

GAAACCTGGC TIGCTGAGGG AAGTATAAAT TAIAACATCA TCTTACAGCT 200 
ETWL PEG SIN YNII LQL 

AGAancnc tceagaaagg agggcaaatc gagtgaagtg ccatatctoc 250 

DLF CRKE GKW SEV PYVQ 

AAAcrncrr ttcateaaga gacaactcac aateaigtaa aaagigiggt 300 

TFF SLR DNSQ L C K K C G 

TEAlUULJL'iA CAGGAAGODC TCAGAGTXA GCTOX'I!ACC CCAGOGTOCC 350 
LCPT GSP QSP PPYP SVP 

CTCQILLJACT (JLTlU-'llAA CTAATAAGGA OC033CTTEA AOGCAAAQOG 400 
SPT PSST N K D PPL TQTV 

TCCAAAAQGA GA!EAGACAAA G33GTAAACA ATGAAQCAAA GAGTQGCAAT 450 
QKE IDK GVNN EPK SAN 

ATiujuuiiflT TAIGODGGCT OCAAGCAGIG AGAQ3AQ3AG AAn0330X' 500 
IPRL CPL Q A V RGGE FGP 

AGOSGftGIG OTIGTACdT 'ITIL'IL'IUIL' AGACITAAAG CAAATEAAAA 550 
ARV PVPF SLS DLK QIKI 

TAGACCTAGG TAAAITCTCA GATAAOGCTC A033CEATAT TGATGTITEA. 600 
DLG KFS DNPD GYI DVL 

CAAGOGTIAG GACAAUUJ1T TGA3UIGACA TGGAGAGAIA TAAIGTEACT 650 
QGLG QSF DLT WRDI MLL 

ACEAAATOG ACACEAAG3C CAAATGAGAG AAGIG0O3CT GTAACIGGAG 700 
LNQ TLTP NER S A A V T A A 

CX3JGAGAGTT 1Q3GGATCTT ' lUU ' iaaL ' lUA GICA ULJULJA A CAATAGGATC 750 
R E F GDL WYLS Q A N N R M 

ACAACAGAQ3 AAAGAACAAC TtXCACAGGC CAGCM3CAG TTGGCAGTGT 800 
TTEE RTT PTG QQAV PSV 
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'lUlUJUL'lUl' GCTOLTXdATC CAGCAGAGQC GCCCATIQOC TCICCCAA3T 5C 
CPLC S.S STG AHCL SQL 
VRC APDP AQA PIA SPNW 
SAV LLI QHRR PLP L P I 

QGQCIAAAQG CTTQCCATIG TIDCTUCACA GCTAAGiaCC TG3GTTCATC 100 
G.R LAIV PAQ LSA WVHP 
AKG LPL FLHS .VP GFI 
GLKA CHC SCT AKCL GSS 

CTAATOGAGC TGAACACTAG TCACTGG3IT GCAGGGTTCT CTIOCATGAC 150 

NRA EH. SLGS TVL FHD 
LIEL NTS HWV PRFS SMT 
. SS TLV TGF HGS LP.P 

CCAT33CTTC TAATAGAGCT ATAACACICA CIQCA3GGTC CAAGATiaZA 200 
PWLL IEL .HS LHGP RFH 
HGF . .SY NTH CMV QDSI 
MAS NRA ITLT AWS KIP 

TTOCTIG3AA TOOJEGAGAC CAAGAAGCDC AGGTCAGAGA ACACAAQGCT 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN TRL 

TGOCACCATG TT33AAGGAG CGCaa^OCA TITTGGAAGC AGOGCGGCAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKQ PAT 
ATM LEAA HHH FGS SPPL 

TATCITGGGA GCICIG3GAG CAAGGAOOCT AGGTAACAAT TIGGTGAOCA 350 
YLGS SGS KDP R.QF GDH 
I L G A L G A RTP G N N LVTT 
SWE LWE QGPQ VTI W.P 

GGAAGGGAGC TGAATCGGGA AOCATGAAG3 GATCICXZAAA GGAATIGGAA 400 
EGT I R N HEG ISK AIGN 

KGP ESA TMKG SPK QLE 
RRDL NPQ P.R DLQS NWK 
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AiUl'lLL'lLC CAAGGCAAAA AIGICCCIAA GATCTATTCT G2AGAATIQ3 450 

VPP KAK MPLR CIL ENW 
MFLP RQK CP. DVFW RIG 
CSS QGKN APK MYS GELG 

GACCAA3TTG ACCCTCAGAC AGIAAGAAAA AAATGACTTA TATICTICTG 500 
DQFD PQT VRK K.LI FFC 
TNL TLRQ E K NDL YSSA 

PI. PSD S.KKK MTY ILL 

CAGTACCGCC CIGOCCACGA TALLUL'lL'l'lL' AAG3G3GAGA AACCIG3CCT 550 
STA LATI SSS RGR NLAS 
VPP WPR YPLQ GGE TWP 
QYRP GHD ILF KGEK PGL 

CCTGAGGGAA GTDOAAATIA TAACACCATC TTACAGCTAG AOCIGTTTIG 600 

.GK YKL H H L TAR PVL 

PEGS INY NTI LQLD LFC 
LRE V.II TPS YS. TCFV 

TAGAAAAGGA GGCAAATG3A GIGAAGTGCC ATATTTACAA ACTITCITTT 650 
K R R QME .SA IFTN FLF 
RKG GKWS EVP YLQ TFFS 
EKE ANG VKCH IYK LSF 

CMTAAAAGA CAAdCQCAA TTKTGITAAC AGTGTGATTT UlUl'lOCTAC 700 
I K R Q L AI MLT V.F VFLH 
LKD NSQ LC.Q CDL CS Y 
H.KT TRN YVN SVIC VPT 

ACGGAAGCCC TCAGATTCTA CICCCCACCC CCGGCATCIC OOCIGAATCC 750 

GSP QIL LPTP GIS PES 
TEAL RFY SPP PASP LNP 
RKP SDST PHP RHL P. IP 

CTCCCCAACT TATT 764 
L P N L 
S P T Y 
P Q L I 
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TGTOOGCrGT GCTOCTGATC CAGCACAG3e GGGCATTGOC T CIXXCAATT 50 
CPLC S.S STG A H C L SQL 
VRC APDP A Q A PIA SPNW 
SAV L L I QHRR PLP LPI 

GGGCTAAAGG CTIC30CATIG TIOTIGCACA GCIAAGIGGC TGGGTTCATC 100 
G .R LAIV PAQ LSA WVHP 
AKGLPLFLHS .VPGFI 
GLKA CHC SCT AKCL GSS 

CIAATOGAGC TCAACACTAG TCACTGGGTT QCACGGTTCT CTTCCAIGAC 150 

NRA EH. SLGS TVL FHD 
LIEL NTS HWV PRFS SMT 
. SS T L V TGF HGS LP.P 

OCATGOdTC TAATAGAGCT ATAACACICA. CIGCATGSTC CAAGATTCCA 200 
PWLL IEL .HS LHGP RFH 
HGF SY NTH CMV QDSI 

MAS NRA ITLT AWS KIP 

TTDCITGGAA TCGGTGAGAC CAAGAAGQOC AGGTCAGAGA ACACAAQGCT 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN TRL 

TGOCADCATG TIGGAAGCAG GOCACCAaZA TTTIGGAAGC GGOO03CXAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKR PAT 
ATM LEAA HHH FGS GPPL 

TATCITGGGA GCICIGGGAG CAAGGAOCXX CAGGTAACAA TTT33TGAOC 350 
YLGS SGS KDP QVTI W.P 
ILG ALGA RTP R.Q FGDH 
SWE LWE QGPP GNN LVT 

ACGAAGGGAC CTGAATOCI3C AAGCATGAAG GGATCTGCAA AGCAATIGGA 400 
R.RD LNPQ P.R DLQ SNWK 
EGT .IR NHEG ISK A I G 
TKGP ESA TMK GSPK QLE 
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AAliUl'lUL'lC CCAAG3CAAA AATGCCCC3A AGA1GIATIC TO3AGAATIG 450 

CSS QGK NAPK MYS GEL 
NVPP KAK MPL RCIL ENW 
MFL PRQK CP. DVF WRIG 

QC1Z\CCAATCT GACCCICAGA CAGTAAGAAA AAAAATCACT TATAT J .L T 1L' 500 
GPI. PSD SKK KNDL YSS 
DQS DPQT VRK KMT YILL 
TNL TLR Q.EK K.L IFF 

TGCAGIACCG CCIG3CCACG GATAICCTCT TCAAGGGGGA GAAAQCTQ3C 550 
AVP PGHG YPL QGG ETWP 
QYR LAT D I L F KGE KPG 
CSTA WPR ISS S RGR NLA 

CTCCTGAQ3G AAGTATAAAT TATAACACCA '1LT1ACAGCT AGACCIGTTT 600 

PEG SIN YNTI LQL DLF 
LLRE V.I ITP SYS. TC F 
S.G KYKL .HHLTARPVL 

TGTftGAAAAG GAGGCAAATG GAGTGAAGIG CCATATITAC AAACTITCTT 650 
CRKG GKW SEV PYLQ TFF 
VEK EANG VKC HIY KLSF 
.KR RQM E.SA IFT NFL 

TICATTAAAA GACAACICGC AATTAIIGTAA ACAGTGTGAT TIUIUIUCTA. 700 
SLK DNSQ LCK QCD LCPT 
H.K TTR NYVN SVI CVL 
FIKR Q L A IM. TV.F VSY 

CAGGAAQCCC TCAGATCEAC CTOCCTACCC C03CATCICC CTGACICCTT 750 

GSP QIY LPTP ASP .LL 
QEAL RST SLP RHLP DSF 
RKP SDLP PYP GIS LTPS 

CCCCAACIAA TAAGGACCCA LTIUA GCCCA AACAGTCCAA AAQGACA1AG 800 
PQLI RTH FSP NSPK GH 
PN._.GPT SAQ TVQ KDI 
PTN KDP LQPK QSK RT. 
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Q3CATIGATA. OCZAOCXZATCA GATG302AAA TCATTATTIA CIQ3A0C3O3 50 
GIDS THQ MAK SLFT GPG 
A L I APIR WPN HYL LDQA 
H.. HPS DGQI IIY WTR 

CLTITILAAA ACTATCAAQC AGATAGGGQC CGIGAAGCAT GOCAAAGAAA 100 
LFK TIKQ IGP VKH AKEI 
FSK LSS R.GP .SM PKK 
PFQN YQA DRA REAC QRN 

TAATOOCDCIG CCTEA1D3CC AlUi'lULTIU AGGAGAACAA AGAACAG3CC 150 

IPC L I A MFLQ ENK EQA 
. SPA LSP CSF RRTK NRP 
NPL PYRH VPS'GEQ RTGH 

ATIBCCCAGG GGAAGACIQ3 CAACIS^GATT T1AO0CACAT G3CCAAAIGT 200 
ITQG KTG N.I LPTW PNV 
LPR GRLA TRF YPH GQMS 
YPG EDW QLDF THM AKC 

CAQGGATITC AGCATCI30? AGTCIGGGCA GATACTITCA CIGGTIGGGT 250 
RDF SIY. SGQ ILS LVGW 
GIS AST SLGR YFH WLG 
QGFQ HLL VWA DTFT GWV 

GGAGTCTICr CCTIGTAGGA CAGAAAAGAC CCAAGAG3TA ATAAAG3CAC 300 

SLL LVG QKRP KR. .RH 
GVFS L.D RKD PRGN KGT 
ESS PCRT EKT QEV IKAL 

TAATGAAATA ATICGC A GAT TIGGA C.T1UL ' CCCAQGATm GAQ3GIGWZA 350 
. . NN SQI W TS PRIT G.Q 
NEI IPRF GLP PGL QGDN 
MK. FPD LDFP QDY RVT 
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ATGGCCCCGC TTTCAAQ3CT GCAGTAACCC AQ33ACTATC CCAQGIUI' IA 400 
WPR FQGC SNP GSI PGVR 
GPA FKA AVTQ GVS QVL 
MAPL SRL Q.P REYP R C . 

GGCATACAAT ATCACTTACA CTCIGOCIGG AGGCCACAAT CCTCCAGAAA 450 

HTI SLT LCLE ATI LQK 
GIQY HLH CAW RPQS SRK 
AYN ITYT VPG GHN PPEK 

AGICAAGAAA ATGAATGAAA CACICAAAGA TCTAAAAAAG CTAACCCAAG 500 
SQEN E.N TQR SKKA NPR 
VKK MNET LKD .LKK LTQE 
SRK M K HSKI . K S .PK 

AAACGCACAT TGCATGAGCT GTICIGTTGC CTATAACCTT ACTAAGAATC 550 
NPH CMTC SVA YNL TKNP 
T H I A.P VLLP ITL LRI 
KPTL HDL FCC L.PY . E S 

CATAACTATC CCCCAAAAAG CAG3ACTTAG CCCATACGAG ATOCTATATG 600 

. LS PKK QDLA HTR CYM 
HNYP PKS RT. PIRD AIW 
ITI PQKA GLS PYE MLYG 

GAT03JLT1T CCTAACCAAT GA CJLT1U1U C TTGACIGAGA AATG3QCAAC 650 
DGLS .PM TLC LTEK WPT 
MAF PNQ. PCA L R NGQL 

WPF LTN DLVL D.E MAN 

TTAGTTGCAG ACATCACCTC CTTAGCCAAA TATCAACAAG TICTTAAAAC 700 
• LQ TSPP .PN INK FLKH 
SCR HHL LSQI STS S.N 
L V A D ITS L A K YQQV LKT 
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ATCACAGGGA ACCTU1UX.C GAGAG3AG33 AAAQGAACEA. TIDCACGCIG 750 

HRE PVP ERRE RNY STL 
ITGN LSP RGG KGTI PPW 
SQG TCPR EEG KEL FHPG 

GTGACAIG 758 
V T 

. H 

D M 
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au moiris 70 % d'hpmologie avec respectivement les sequences (i) ou (ii), et utilisations pour detecter un retrovirus associe* &Xi sclerose en 
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MATERIEL NUCLEI QUE RETROVIRAL ET FRAGMENTS NUCLEOTIDIQUES 
NOTAMMENT ASSOCIES A LA SCLEROSE EN PLAQUES ET/OU LA 
POLYARTHRITE RHUMATOIDE, A DES FINS DE DIAGNOSTIC, 
PROPHYLACTIQUES ET THERAPEUTIQUES 
5 La^ Sclerose- en Plaques (SEP) est une maladie 

demyelinisante du systdme nerveux central (SNC) dont la 
cause complete feste encore inconnue. 

Die nombreux travaux on t e t ay e 1' hypo these d'une 
etiologie virale de A la maladie, mats: aucun des virus 
10 connus testes ne s'est avere etre 1' agent causal 
recherche: une revue des virus recherches depuis des 
annees dans la SEP a ete faite par E. Norrby et R.T. 
Johnson 

Recemment, un retrovirus, different des retrovirus 
15 humains connus a ete isole chez des patients atteints de 
SEP. Les auteurs ont aussi pu montrer que ce retrovirus 
pouvait etre transmis in vitro, que des patients atteints 
de SEP produisaient des anticorps susceptibles de 
reconnaitre des proteines associees a 1 ' infection des 
20 cellules leptomeriingees par ce retrovirus, et que 
1 ! expression . de ce dernier pouvait etre f ortement stimulee 
par Y les genes immediats-precoces de certains herpesvirus. 
I" ~ Tous ces retsultats plaident en faveur du role dans 
. la * SEP d ' au ittoins ^ uh retrovirus iricohnu ou d 1 tin virus 
25 „ ayant une o activite transcriptase inverse (RT) detectable 
, selon ia raethode publifie par H. Perron et qualifiee 
d» activite ,f RT de type LM7". 

Lea travaux de* la Demanderesse ont permis 
. ledfobtenir^ deux - lignees continues de cellules ihfect^es par 
30- des isolats naturels provenant de deux patients differents 
atteints de^SEP, par un procede de culture tel que decrit 
dans ^ le docuineht WO-A-93 20188/ dont le cbntenu est 
incorpbre: par reference: a la:; presente description. Ces 
deuxc lignees d€rivees de . cellules de plexus-chbroldes 
35/ humains, denomihees LM7PC et PLI-2 ont ete deposees a 
l^E^CvAvC^C. respect ivement le 22 juillet 1992 et r le 8 
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janvier 1993, sous les numeros 92 072201 et 93 010817, 
cpnformement aux dispositions du< Traite de Budapest. Par 
ailleurs, les isolats viraux possedant une activite RT de 
type LM7 ont egalement; ete deposes a l'E.C.A.C.C. sous la 
5 denomination globale.de, "souchesV. La "souche" ou isolat 
heberge par la lignee ^ PLI-2y ; denommee POL-2, a ete deposee 
aupres de 1 'E.G. A>.<3*:G: lev 22 juillet 1992 sous le 
n° V92072202. La "souche" ^ ou, isolat heberge par la lignee 
. LM7PC, denoimtiee MS7/PQ, a ete deposee aupres de 

10 l'E.C.A.CC. le : ;8 janvier 1993 sous le n° V93010816. 

A ; partir des- cultures ,et des: isolats precites; 
caracterises par des K criteres biologiques et 
morphologiques, on s'est ensuite attache a caracteriser le 
materiel nucleique : assoeie aux particules virales 

15- produites dans ces r ( cultures>; ; „,; 

Les portions de genome deja caracterisees ont ete 
utilisees , pour mettre au point des tests de detection 
mol^culaire du ^ /genome;; viral- - et des tests 
immunoserologiques > > rjut ilisant ties sequences d 1 amineoacides 

20 codees par les sequences ,-nucleotidiques du genome viral, 
• >«\P9ur : detecter la^ reppnse.i immunitaire dirigee. contre des 
z. epitopes ; associes a/ 1* inf ective virale. 

r:, ; ^- f : Ges ,outil*s e ^pnt q d€ ja ( -permis de confirmer une 

association (; entre la - jS EP, Ofc expression ides sequences 

25 identifiers dans les cbreyetsn cites plus loin. Cependant, 
le systeme viral ndecouVert : par la Demanderesse, 
s'apparente a un systeme retroviral complexe. En T effet, 
les sequences : xetrouv6es. encapsidees dans les particules 
virales extracellulaires^ produites par les diffferentes 

30 cultures de cellules -de ^patients atteints de;;SEP, montrent 
clairement qu'il o-y -a -eorreneapsidation de genomes 
retroviraux apparentfes / mais differents du genome 

** :„ retroviral ■"sauvagje?*^ qui produit les particules " virales 
-..-,y s infectantes. Ce phenomSne ,.> a 6te observe entre des 

35 retrovirus r£plicatifs et" des retrovirus endbgenes 
appartenant a la meme famille; voire meme heterologues. La 
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notion de retrovirus endogene est tres importante dans le 
contexte de notre decouverte car, dans le cas_de MSRV-l, 
on a observe que des sequences rStrovirales endogenes 
comprenant des sequences homoldgues au genome MSRV-l, 
5 existent dans l'ADN humain normal. L'existence d'elements 
r£troviraux endogenes (ERV) apparentes a MSRV-l par tout 
ou partie de r leur genome, explique le fait que 
1* expression du retrovirus MSRV-l dans les cellules 
humaines puisse interagir avec des sequences endogenes" 

10 proches . Ces interactions sont retrouvees dans le cas de 
retrovirus endogenes pathogenies et/ou infect ieux (par 
exemple certaines souches ecotropes du Murine Leukaemia 
virus), dians le cas de retrovirus exogenes dont la 
sequence nucleotidique peut etre retrouvee partiellement 

15 ou en totalite, sous forme d'ERVs, dans le genome de 
l 1 animal hote (ex. virus exbgene de la tumeur mammaire de 
la souris transmis par le lait) . Ces interactions 
consistent principalement en (i) une transactivatioh ou 
co-activation d'ERVs par le retrovirus replicatif, (ii) 

20 une encapsidation "illegitime" d'ARN : apparentes d'ERVS, ou 

> - d'ERVs -voire d'ARN cellulaires- possedant s implement des 
sequences d' encapsidation compatibles, dans les pairticules 
. nretrovirales produites pair 1' expression de la souche 

:i j^replicative/ parfois transraissibles et parfois avec une 

25 ; pathogenicite propre, et (iii) des recombinaisdhs plus ou 

- . t - moins importantes entre - les genomes co-encapsides , 
notamment dans les phases de transcription inverse, qui 
conduisent ■- a la formation de genomes hybrides, parfois 

*/o : transmissibles et parfois avec une pathogenicite propre. 

30 . Ainsi, (i) differentes sequences apparentees a 

MSRV-l ont ete retrouvees dans les partrcules virales 
purifi6es; (ii) l'^halyse mol£culaire des differentes 

■x. regions . du genome 1 retroviral MSRV-i doit etre faite en 
analysant systematiqu^ment les sequences co-encapsidees, 

35 inter fferantes et/ou recombiriees qui sont generees par 
1* infection et/ou 1* expression de MSRV-l, de plus, 
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certains clones . peuvent avoir des parties de sequences 
defectives produites par la replication retrovirale et les 
erreurs de matrice et/ou de transcription de la 
transcriptase inverse; (iii) les families de: sequences 
5 apparentees, a une meme region genomique retrovirale sorit 
les supports d'une detection diagnostique globale qui peut 
etre optimisee. par 1 • identification de regions invar iables 
parmi les clones exprimes et par 1 ' identification de 
trames de lectures responsables de x la production de 
10 polypeptides antigeniques et/ou pathogenes qui peuvent 
n'etre produits que par une partie, voire -.un . seul, des 
clones exprimes et dans ces conditions, 1' analyse 
systematique. des clones exprimes dans une region d'un gene 
donne permet d/evaluer la frequence de variation et/ou de 
15 recombinaison . du genome MSRV-l dans cette region et de 
definir les sequences optimales pour les applications, 
notamment. diagnostiques; (iv) la pathologie provoquee par 
un retrovirus tel que MSRV-l peut etre un ef f et direct de 
son expression et des proteines ou peptides produits de ce 
20 fait, mais aussi un effet de 1 • activation, de 
1 Vfncap^idation,; de , la recombinaison .de genomes apparentes 
9U hSterplogues et ,des proteines ou peptides produits de 
ces fait ?;,' a i n si ces genomes associes a 1' expression de 
e V ou ^ :Wl^ ection P ar MSRV-l sont-ils une partie 
2 ^ .:/ nt :® gran ^. e .. de . ■■:^ a -...Pathogenicite potent ielle de ce virus et 
- done, . constituent^, des supports de detection, diagnostique 
; et . d P s cibles therapeutiques particulieres. De meme, tout 
^ , agent , asso £f® a .» ; .. 9 U ' co-facteur de pes . interactions 
re .fP° nsables i de c<?- a : P a .thogenie en cause, tel que MSRV-2 ou 
?° le > . ; f ac ^ eur gifPt.pxique decrit dans la demande de brevet 
. p ; u, ?/ li§e : sous x f i N ^ FR-2 716 198, peut participer a 
. - . ^ 63 r afc ^ rat A on d '.une . strategie globale et tres efficace de 
d f agno£ ?£i c r qe pronostic, de suivi therapeutique et/ou de 
therapeutique integree de la SEP notamment > mais aussi de 
35 toute autre maladie associee aux memes agents. 



WO 99/02666 



PCT/FR98/01460 



Dans ce contexte, on a fait une decouverte 
parallele dans une autre maladie autoimmune, la 
polyarthrite rhumatolde (PR) , qui a ete decrite dans la 
demande de brevet frangais depbs^e sous le N°95 02960. 
5 Cette decouverte montre que, ; en appliquant des approches 
-methodologiques simi-laires a celles qui furent utilisees 
dans les travatfx de la Demanderesse sur la SEP, on a pu 
identifier: un. retrovirus exprime dans la PR qui partage 
les sequences decrites pour MSRV-1 dans la SEP et aussi, 

10 la -co-existence d'une sequence associee MSRV-2 egalement 
decrite dans la SEP. En ce qui concerhe MSRV-1, les 
sequences detectees communement dans la SEP et la PR, 
concernent les genes pol et gag. En l'etat actuel des 
connaissances, on peut associer les sequences gag et pol 

15 decrites aux souches MSRV^-1 exprimees dans ces deux 
maladies. 



:La presente demande de brevet a pour objet 
differents : resultats, supplementaires par rapport a ceux 
deja proteges par les demandes de brevet frangais : 
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N° 
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01532 
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.02. 


1994, 


publiee 


sous 


le 
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715 


937 






N° 


94 


14322 
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.11. 


1994, 


publiee 


sous 


le 


N° 
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727 


428 






N° 


94 


15810 


du 


23 


.12. 


1994 , 


publiee 


sous 


le 


N° 


2 


728 


585 



et 



30 - la demande de brevet WO-97/06260. 

La presente invention concerne tout d 1 aboird un 
materiel nucl^ique, qui peut consisted en un materiel 
; retroviral, , OS v l^tat isol6 ou purif ie, pouvant etre 
apprehende ou caracterise dei dif f erentes manieres : 
35 . ; - il comprend une sequence nucleotidique choisie 

dans le - groupe qui consiste en (i) les sequences 
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SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ _ID NO: 130, 

SEQ ID NO: 141 et SEQ ID NO: 142 ; (ii) les sequences 
complementaires aux sequences (i) ; et (iii) les sequences 

5 equivalentes aux sequences (i) ou (ii), en particulier les 
sequences presentant pour toute suite de 100 monomeres 
contigus, au moins 50 %, et pref erentiellement au moins 
7 0- % d'homologie avec respectivement les sequences (i) ou 

- (ii); 

10 , - il code pour un polypeptide presentant, pour 

toute suite contigue d'au moins 30 acides amine&s, au 
moins 50 %, et de preference au moins 70 % d'homologie, 
avec une sequence peptidique choisie dans le groupe qui 
consiste en SEQ ID NO: 113, SEQ ID N°115, SEQ ID NO: 118, 
15. SEQ ID NO: 121; SEQ ID NO: 135 et SEQ ID NO: 137; 

- son gene pol comprend une sequence nucleotidique 
identique ou equivalente a une sequence choisie dans le 
groupe qui consiste en SEQ ID NO: 112, SEQ ID NO: 124 et 
leurs sequences complementaires; 
20 - l'extremite 5' de son gene pol commence au 

nucleotide 1419 de SEQ ID NO: 130; 
: ;s> " son gene pol code pour un polypeptide 

: P r 4 #sentant, pour toute suite contigue d'au moins 3 0 acides 
^minees, au moins 50 %, et de preference au moins 70 % 
25 d'homologie, avec la sequence peptidique SEQ ID NO: 113; 

. .* - l'extremite 3' de son gene gag finit au 

nucleotide 1418 de SEQ ID NO: 130; 

- r son gene eny comprend une sequence nucleotidique 

identique ou Equivalente a une sequence choisie dans le 
30 groupe qui consiste en SEQ ID NO: 117, et ses sequences 
. complementaires; 

/ - son gene env comprend une sequence nucleotidique 
qui commence au nucleotide 1 de SEQ ID NO: 117 et finit au 
nucleotide au nucleotide 233 de SEQ ID NO: . 114 ; 
35 o -son gene env code pour un polypeptide 

.presentant, pour toute suite contigue d'au moins 30 acides 
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aminees, au moins 50 %, et de preference au moins 7 0 % 
d'homologie, avec la sequence SEQ ID NO: ° 118; 

- la region U3R de son LTR 3 1 comprend une 
sequence nucleotidique qui se termine au nucleotide 617 de 

5 SEQ ID NO: 114; 

- la region RU5 de son LTR 5 1 comprend une 
sequence nucleotidique qui commence au nucleotide 755 de 
SEQ ID NO: 120 et finit au nucleotide 337 de 
SEQ ID NO: 141 ou SEQ ID NO: 142; 

10 - un materiel nucleique retroviral comprenant une 

sequence qui commence au nucleotide 755 de SEQ ID NO: 12 0 
et qui se termine au nucleotide 617 de SEQ ID NO: 114; 

- le materiel nucleique retroviral tel que def ini 
pr^cedemment est en particulier associe a au moins une 

15 maladie auto-immune telle que la sclerose en plaques ou la 
polyarthrite rhumatoide. 

L 1 invention concerne aussi un fragment 
nucleotidique qui repond au moins a'l'une des definitions 
suivantes : 

20 - il comprend ou consiste en une sequence 

nucleotidique choisie dans le grdupe qui consiste en (i) 
les sequences SEQ ID NO: 112, SEQ ID NO: 114, 

SEQ ID NO: 117, SEQ ID NO: 120," SEQ ID NO: 124, 

SEQ ID NO: 130, SEQ ID NO: 141 et SEQ ID NO: 142 ; (ii) 

25 les sequences complementaires des sequences (i) ; et (iii) 
les sequences equiyalentes aux sequences (i) ou (ii) , en 
particulier les sequences presentant pour toute suite de 
100 monbmerers coritigus, au moins 50 %, et 
preferentiellement au moins 70 % d» homologies « avec 

30 respect ivement les sequences (i) ou (ii) ; 

- il comprend ou consiste en une sequence 
nucleotidique codant pour un polypeptide presentant, pour 
tbute suite contigue d*au mcJinis 30 acides aminees, au 

■ moins 50 %, et de preference au moins 70 % d^omologie, 
35 avec une sequence peptidiquie choisie dans le groupe qui 
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consiste en SEQ-ID NO: 113, SEQ ID N°115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 et SEQ ID NO: 137. 

D 1 autres objets de la presente invention sont les 
suivants : 

5 - une sonde nucleique pour la detectipn d'un 

retrovirus associe a la sclerose en plaques et/ou la 
polyarthrite rhumatoide, susceptible de s'hybrider 
specif iquement sur tout fragment precedemment defini et 
appartenant au genome dudit retrovirus; elle possede 
10 avantageusement de 10 a 100 nucleotides, de preference de 
10 a 30 nucleotides; 

- une amorce pour 1 1 amplification par 
polymerisation d'un ARR ou d'un ADN d'un retrovirus 
associe a la sclerose en plaques et/ou la polyarthrite 

15 rhumatoide, qui comprend une sequence nucleotidique 
identique ou equivalente a au moins une partie de la 
sequence nucleotidique d'un fragment defini precedemment, 
notamment une sequence nucleotidique presentant pour toute 
suite de 10 monomeres contigus, au moins 50 %, de 

20 preference au moins 70 % d'homolqgie avec au moins ladite 
partie dudit fragment ; de preference la sequence 
nucleotidique d'une amorce de 1* invention est choisie 
parmi SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 
SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 

25 SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, et 

SEQ ID NO: 133; . 

- un ARN ou ADN, et notamment vecteur de 
replication et/ou d* expression, comprenant un fragment 
genomique du materiel nucleique ou un fragment defini 

30 precedemment; ' 

un peptide code par tout cadre ,de lecture ouvert 
appartenant a un fragment nucleotidique . defini 
precedemment, notamment un polypeptide, par exemple 
oligopeptide formant ou comprenant un determinant 

35 antigenique reconnu par des sera de patients infectes par 
le virus MSRV-1, et/ou chez lesquels le virus MSRV-1 a ete 
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reactive; un peptide preferentiel comprend une sequence 
identique, partiellement ou totalement, ou equivalente a 
une sequence choisie parmi SEQ ID NO: 113, SEQ ID N°115, 
SEQ ID NO: 118, :SEQ ID NO: 121,. SEQ ID NO: 135 et 

5 SEQ ID NO: K*7; 

- une composition diagnostique, prophylactique, ou 
therapeutique, *notamment pour inhiber l'expression d'au 
moins un; retrovirus associe a la sclerose en plaques- et/ou 
a la polyarthrite rhumatoide, comprenant un fragment 

10 nucleotidique defini precedemment ; 

- un procede pour detecter un retrovirus associe a 
la sclerose en plaques et/ou a la polyarthrite rhumatoide, 
dans un echantillon biologique, comprenant les etapes 
consistant a mettre en contact un ARN et/ou un ADN presume 

15 appartenir ou provenant dudit retrovirus, ou leur ARN 
et/ou ADN complementaire, avec une composition comprenant 
un fragment nucleotidique defini ci-dessus. 

Avant.de- detainer. 1 1 invention,, differents termes 
utilises dans la description et les revendications sont a 

20 present def inis : 

- par souche pu isolat, on. entend. toute fraction 
biologique infectante et/ou pathogene, contenaint par 
exemple des virus et/ou; des bacteries et/ou des parasites, 
generality un pouvoir pathogene et/ou- antigenique, hebergee 

25 par une culture ou un hote vivant ; a titre d. 1 exemple, une 
v souche yirale selon la definition precedente peut contenir 
un agent co-inf ectant , par exemple un protiste pathogene, 

- le^ terme "MSRV", utilise dans la presente 
; description designe tout agent pathogene et/ou* infect ant, 

30 associ<§ a la, SEP, notamment une, espece virale, les souches 
attenuees \ de ladite espece viraley . ou les particules 
defectives interferences ou contenant : des genomes co- 
encapsides ou- encore, des . genomes recombines avec une 
partie du genome MSRV-1, derivees de; cette espece. II est 

3,5 connu que les* virus et particuliereroent les virus 
contenant de I'ARN ont une variability, consecutive 
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notamment a des taux relativement eleves de mutation 
spontanee, dont il sera term compte ci-apres pour definir 
la notion d • equivalence, 

r par virus humain, on entend un' virus susceptible 
5 d f infecter ou d'etre heberge par 1 1 etre Kumain, 

- compte tenu de toutes les variations et/ou 
recombinaison haturelles ou induites, pduvant etre 

; rencontrees dans la pratique de la presente invention, les 
objets de cette .derniere, definis ci-dessus et dans les 
10 revendications, ont ete exprimes en comprenant les 
equivalents ou derives des dif f erents materiels 
biologiques definis ci-apres, notamment des sequences 
homologues nucleotidiques ou peptidiques, 

- le variant d^un virus ou d'un agent pathogene 
15 et/ou infectant selon 1 ' invention, comprend au moins un 

antigene reconnu par au moins un anticorps dirige contre 
au moins un antigene correspondant dudit virus et/ou dudit 
_ agent pathogene et/ou infectant, et/ou un genome dont 
toute . partie est detectee> par au moins line sonde 

20 d 1 hybridation, et/ou au moins une amorce d 1 amplification 
nucleotidique- specif ique dudit virus eit/ou agent pathogene 
et/ou infectant, dans des conditions d 'hybridation 
determinees bien connues de i'hommie de l r art, 
: r - r- selon 1* invention, un fragment nucleotidique ou 

25 ,un oligonucleotide ou un polynucleotide est un 
renchalnement de monomeres, ou un biopolymere^ caracterise 
par la sequence inf ormationnelle des acide£ nucleiques 
naturels , susceptible de s 1 hybrider a r tout autre fragment 
nucleotidique dans des conditions predetermines, 

30 1 lenchainement pouvant contenir des monomeres de 

. structures chimiquesi dif f erentes " et gtre obtenu a partir 
d*une molecule -d'acide nucleique naturelle et/ou par 
recombinaison genet ique et/ou par' synthese chimique ; un 
fragment nucleotidique peut etre identique a un fragment 

35 genomique du virus MSRV-1 considered par la presente 
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invention, notamment un gene de ce dernier, par exemple 
pol ou env dans le cas dudit virus ; 

- ainsi un monomere peut etre un nucleotide 
naturel d'acide nucleique, dpnt les elements constitutifs 

5 sont un sucre, un groupement phosphate et une base azotee; 
dans l'ARN le sucre est le ribose, dans l^ADN le sucre est 
le desoxy-2-ribose; selon qu'-il. s.'agit de I'ADN ou 1 • ARN, 
la base azotee est choisie parmi I'adenine, la guanine, 
I'uracile, la cytosine, la thymine; ou.-'Ie: nucleotide peut 

10 etre modifie dans I'un au moins des trois elements 
constitutifs ; a titre d'exemple, la modification peut 
intervenir au niveau des bases, generant des bases 
modifiees telles que l'inosine, la methyl-5- 
desoxycytidine, la desoxyuridine , la 

15 dimethylamineo-5-desoxyuridine, la diamineo-2 , 6-purine, la 
bromo-5-desoxyuridine et toute autre base modifiee 
favorisant 1" hybridation; au niveau du sucre, la 
modification peut consister dans: le -remplacemeht d'au 
moins un desoxyribose par un polyamide, et au niveau du 

20 groupement phosphate, la modification peut consister dans 
son remplacement par des esters, notamment choisis parmi 
les esters de diphosphate, d'alkyl et arylphosphonate et 
de phosphorothioate, . . 

par "sequence irif ormationnelle" , on . s entend toute 

25 suite ordonnee de monomdres> dont la nature chimique et 
1 •ordre dans un sens de reference, constituent ou non une 
information f onctionnelle de meme qualite que celle des 
. . acides nucleiques naturels , 

- par: hybridation, on entend le processus au cours 
30 duquel, . dans des ^conditions operatoires appropriees, deux 

fragments nucleotidiques, ayant des sequences suffisamment 
complementaires, s' apparient pour former une structure 
complexe, notamment double ou /triple, de preference sous 
forme d'helice, . 
35 -une sonde comprend un fragment nucl^otidique 

isynthetise par voie chimique. ou obtenu par digestion ou 
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coupure enzymatique d'un fragment nucleotidique plus long, 
comprenant au moins six monomeres, avantageusement de 10 a 
100 monomeres, de preference 10 a 30 monomeres, et 
possedant une specif icite d 'hybridation dans des 
5 conditions determinees ; de preference, une sonde 
possedant moins de 10 monomeres n'est pas utilisee seule, 
mais l'est en ^presence d • autres sondes de taille aussi 
courte ou non ; dans certaines conditions particulieres, 
il peut etre utile d'utiliser des sondes de taille 
10 superieure a 100 monomeres ; une sonde peut ndtamment etre 
utilisee a des fins de diagnostic et il s'agira par. 
exemple de sondes de capture et/ou de detection, 

- la sonde de capture peut etre immobilisee sur un 
support solide par tout moyen approprie, c'est-a-dire 

15 directement ou indirectement , par exemple par covalence ou 
adsorption passive, 

- la sonde de detection peut etre marquee au moyen 
d'un marqueur choisi notamment parmi les isotopes 
radioactifs, des enzymes notamment choisis parmi la 

20 peroxydase et la phosphatase alcaline et ceux susceptibles 
d'hydrolyser un substrat chromogene, fluorigene ou 
luminescent, des composes chimiques chromophores, des 
composes chromogenes, fluorigenes ou luminescents, des 

r analogues de : bases nucleotidiques, et la^biotine, 

25 - les sondes utilisees a des fins de diagnostic de 

1 1 invention peuvent etre mises en oeuvre dans toutes les 
techniques d "hybridation connues, et notamment les 
techniques dites "DOT-BLOT", SOUTHERN BLOT", "NORTHERN 
BLOT" qui est une technique identique : a la technique 

30 "SOUTHERN BLOT" mais qui utilise de l'ARN comme icible, la 
technique SANDWICH ; avantageusement, on utilise la 
.technique SANDWICH dans la presente invention, comprenant 
une sonde de capture: specif ique et/ou une sonde de 
detection specif ique, etant entendu que la sonde de 

35 capture et la sonde de detection doivent* presenter une 
sequence nucleotidique au moins partiellement dif f erente, 
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- toute sonde selon la presente invention peut 
s f hybrider in vivo ou in vitro sur l'ARN et/ou sur l'ADN, 
pour bloquer les phenomenes de replication, notamment 
traduction et/ou transcription, et/ou pour degrader ledit 

5 ADN et/ou ARN, 

- une amorce est une sonde comprenant au moins six 
monomeres, et 1 avantageusement de 10 a 3 0 monomeres , 
possedant une specif icite d 'hybridation dans des 
conditions determinees, pour 1 1 initiation d'une 

10 polymerisation enzymatiquev par exemple dans une technique 
d' amplification telle que la PCR (Polymerase Chain 
Reaction), dans un procede d' elongation, tel que le 
sequen<page,~ dans, une methode de transcription inverse ou 
analogue, 

15 - deux sequences nucleotidiques ou peptidiques 

sont dites equivalentes ou derivees l'une par rapport a 
1" autre, ou par rapport a une sequence de reference, si 
f onctionnellement les biopolymeres correspondants peuvent 
jouer sensiblement le meme role, sans etre identiques, 

20 vis-a-vis de 1' application ou utilisation consideree, ou 
dans la technique dans laquelle : elles interviennent ; sont 
notamment equivalentes deux sequences obtenues du fait de 
la variabilite naturelle, notamment mutation spontanee de 
!• espece* a partir de laquelle elles ont 6te identifiees, 

25 ou induite, ainsi que deux sequences homologues, 
l'homologie etant d#finie ci-apres, 

- par "variabilite" , on entend toute modif ication, 
spontanee ou induite d'une sequence, notamment par 
substitution, et/ou insertion, et/ou deletion de 

30 nucleotides : et/ou de , fragments nucleotidiques, et/ou 
extension et/ou racourcissement de la sequence a l'une au 
moins des extr6mites; une variabilite non naturelle peut 
resulter des techniques de genie .gen§tique utilisees, par 
exemple du - choix des amorces de synthese deg€n§r6es ou 

35 non, retenues pour amplifier un acide nucleique; cette 
variability peut se traduire par des modifications de 
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toute sequence de depart, consideree comme reference, et 
pouvant etre exprimees par un degre d'homologie par 
rapport a ladite sequence de reference, 

- 1 '.homologie caracterise le degre d'identite de 
5 deux fragments nucleotidiques ou peptidiques compares ; 
elle se mesure par le pourcentage d'identite qui est 
notamment determine par comparaison directe de sequences 
nucleoditiques ou peptidiques, par rapport a des sequences 
nucleotidiques ou peptidiques de reference, 

10 - tout fragment nucleotidique est dit equivalent 

ou derive d'un fragment de reference, s'il presente une 
sequence nucleotidique equivalente a la sequence du 
fragment de reference ; selon la definition precedente, 
sont notamment equivalents a un fragment nucleotidique de 

15 reference-: 

(a) tout fragment susceptible de s'hybrider au 
moins partiellement avec le complement du fragment de 
reference, 

(b) tout fragment dont I'alignement avec le 
20 fragment.de reference conduit a mettre en Evidence des 

bases contigues identiques, en . nombre plus important 
qu'avec.tout autre fragment provenant d'un autre groupe 
taxonomique, ;r r u v 

J } :i "(c) tout fragment resultant ou pouvant resulter de 
25 ; ; la variability naturelle de l'espece, a partir de laquelle 
il est obtenu, 

(d) tout fragment pouvant resulter des techniques 
de genie genetique appliquees au fragment de reference, 

(e) tout fragment, compbrtant au moins huit 
30 nucleotides^ contigus, codant pour un peptide homologue ou 

identique au peptide code par ;le fragment de reference, 

r. » , (f ) tout; fragment dif ferent, w du fragment de 
reference, par insert ion, -r. deletion, : substitution d f au 
moins un monoinere, extension, ou raccourcissement a l'une 
35 au moins de ses extremites ; par ekiemple, tout fragment 
corresporidant au fragment de reference, flanque a l'une au 
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moins de ses extremites par une sequence nucleotidique ne 
codant pas pour un polypeptide, 

- par polypeptide, on entend notamment tout 
peptide d'au moins, deux. acides aminees, notamment 

5 oligopeptide, proteine, extrait, separe, ou 

substantial lement isole ou synthetase, „ . par 1' intervention 
de la main de 1 ?1 homme, notamment ceux obtenus par synthese 
chimique, ou par expression dans un organisme recombinant, 

- par polypeptide code.de maniere partielle par un 
10 fragment:? nucleotidique, on entend un polypeptide 

presentant au moins trois acides aminees codes par au 
moins neuf monomeres contigus compris dans ledit fragment 
nucleotidique, 

- un acide aminee est dit analogue a un* autre 
15 acide aminee, lorsque leur caracteristiques physico- 

chimiques respectives, telles que polarite, 

hydrophobicite, et/bu basicite, ;et/ou acidite, et/ou 
neutrality, sont sensiblement les memes ; ~ ainsi, une 
leucine est analogue a une isoleucine. , * 

20. - tout polypeptide est dit equivalent ou derive 

d'umr .polypeptide de reference, si ^ les polypeptides 
compares ont sensiblement les memes^ proprietes, et 
notamment les . v memes proprietes antigeniques, 

.animunologiques,* enzymologiques et/ou jde reconnaissance 

25 , , moleculaire ; est notamment equivalent a un polypeptide de 
reference : 

-(a) tout polypeptide possedant une sequence dont 
au^ moins un acide, aminee a ete substitue par un acide 
aminee analogue, +■ . 

30, (b) tout polypeptide ayant une ^sequence peptidique 

equivalente, obtenue par variation naturelle ou induite 
dudit polypeptide de reference, et/ou- du fragment 
-e , nucleotidique codant pour ledit. polypeptide, r 

■(c) -.un mimotope dudit polypeptide de reference, 
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(d) tout polypeptide dans la sequence duquel un ou 
plusieurs acides aminees de la serie L sont remplaces par 
un acide aminee de la serie D, et vice versa, 

(e) tout polypeptide dans la sequence duquel on a 
5 introduit une modification des chaines laterales des 

acides aminees, telle que par exemple une acetylation des 
fonctions aminees, une carboxylation des fon'ctions - thiol, 
une esterif ication des fonctions carboxyliques, 

(f) tout polypeptide- dans la sequence duquel une 
10 ■'■ .-ou des liaisons peptidiques ont ete modifiees, comme par 

exemple les liaisons carba, retro, inverso, - retro-ihverso, 
reduites, et methylene-oxy y 

(g) tout polypeptide dont au moihs uh ahtigene est 
■ reconnu par anticorps dirige contre un polypeptide de 

15 reference, - 

- le pourcentage d'identite caracterisant 
l'homologde de deux fragments peptidiques compares est 
selon la presente invention d'au moins : 50% et de 
pr6f£rence au moins i 70 %. 
20 Etant donne qu 'un virus possedant une activite 

> enzymatique transcriptase inverse peut etre genStiquemeht 
caracterise aussi* bien sous forme d 'ARN que d 'ADN, ■ il sera 
^-fait mention aussi bien de 1 * ADN que de 1 'ARN viral pour 
caracteriser les sequences relatives a un Virus possedant 
25 une telle activite transcriptase inverse, dit MSRVrl selon 
la presente description. 

Les expressions d'ordre utilisees dans la presente 
description et les rfevendications; telles que "premiere 
sequence nucleotidique" ne sont pas retenues pour exprimer 
30 un ordre parti culler, mais pour definir plus clairement 
1' invention. ; • v- ' *" 

Par detection d 'line substance ou agent, on entend 
ci-apres aussi- bien —une " identification,- qu^ne 
quantification, ou une separation ou isolement de ladite 
35 substance ou dudit agent. 
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L' invention sera mieux comprise a la lecture de la 
description detaillee qui va suivre faite en reference aux 
figures annexees dans lesquelles : 

La Figure 1 represente la structure generale de 
5 1 1 ADN proviral et 1 1 ARN genbmique de MSRV-1 . 

La Figure 2 r*epr&sente la sequence nucleotidique 
du clone denomifte* CL6-5 1 (SEQ ID NO: 112) et trois trames 
de lecture potentielles en acides aminees figurant- sous la 
sequence nucleotidique. 
10 La Figure 3 represente la sequence nucleotidique 

du clone denommS CL6-3 ' (SEQ ID NO: 114) et trois trames 
de lecture potentielles en acides aminees figurant sous la 
sequence nucleotidique. - 

La Figure 4 represente la sequence nucleotidique 
15 du clone denomme C15 (SEQ ID NO: 117) et trois trames de 
lecture potentielles en acides aminees figurant sous la 
sequence nucleotidique. 

La Figure 5 represente la sequence nucleotidique 
du clone denomme 5M6 (SEQ* ID NO: 120) et trois trames de 
20 lecture potentielles en acides aminees figurant sous la 
1 sequence nucleotidique. ^ : 

La Figure 6 represente la sequence nucleotidique 
du clone denomme CL2 (SEQ ID NO: 130) et trois trames de 
lecture jpbtentielles en acides aminees figurant sous la 
25 sequence nucleotidique. 

La Figure 7 represente trois trames de lecture 
' ' potentielles en acides aminees exprimees par pET28C-clone 
2 et figurant sous la sequence nucleotidique. 

La Figure 8 represente trois trames de lecture 
30 potentielles en -acides aminees exprimees par pET21C-rclone 
2 et. figurant sous la sequence nucleotidique. 

La Figure 9- represente la sequence nucleotidique 
du clone denomme -LB13 (SEQ ID NO: 141) et trois trames de 
lecture potentielles en acides amines figurant sous la 
35 sequence nucleotidique. 
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La Figure 10 represente la sequence nucleotidique 
du clone denomme LA15 (SEQ ID NO: 142) et trois. trames de 
lecture potentielles en acides aminees. figurant sous la 
sequence nucleotidique. 
- 5 La Figure 11 represente la sequence, nucleotidique 

du clone denomme LB16 (SEQ ID NO: 124 )et trois trames de 
lecture potentielles en acides aminees figurant sous la 
sequence nucleotidique > 

La Figure 12 represente I'activite promotrice 

10 exprimee en cpm/4min des sequences U3R sous donees: a 
partir de LTRs d'origines differentes dans le plasmide 
PCAT3 . PCAT3 signifie plasmide seul , PCAT-PH74 signifie 
plasmide plus clone U3Rendogene exprime dans le placenta, 
PCAT-cl6 signifie plasmide plus clone U3R amplifie dans 

15 1 • ARN d»un plasma SEP, PCATfr5M6 ; signifie plasmide plus 
region U3R amplifiee dans l'ADN cellulair,e, "no plasmid" 
signifie absence de plasmide dans; le; test; , 

La Figure 13 represente les sequences MSRV1 env et 
LTR3 ' . Les f leches horizontales indiquent le debut des 

20 . regions env, U3 et R. Dans la region env : le peptide 
signal et la region immunosuppressive potentielle sont 
soulignes, les sites de. —glycosilatipn potentiels sont 
encadr.es: et les sites de clivage potentiels sont indiques 
par des f leches verticales. Dans la region ; £ U3R :r l ' element 

25 de regulation CAAT et la TATA Box sont soulignes, le site 
"cap" et le signal de polyadenylation\ sont aussi indiques. 

La Figure 14 represente une region LTR5 1 (RU5) 
suivie d Vun site PBS (primer binding site) complementaire 
. r du tARN Trp etd'un gene gag eodant pour une proteine 

30 d 1 environ 487 acides: amines . Les acides amines *conserv6s 
dans la nucleocapside sont soulignes .deux f pis. Les acides 
amines definissant la region de plus forte, homologie dans 
la capside sont en gras et soulignes une f pis. Les /. dans 
_la. sequence eh acides amines indiquent des variations 

35 observ6es selon les clones et dans la sequence 
nucleotidique ils indiquent des sauts de trame dans 
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certains clones. Les regions encadrees correspondent a des 
epitopes identifies par analyse peptidique de la region C- 
terminale. 

La Figure 15 represente la region integrase de 
5 MSRV1, la sequence nucleotidique et la sequence en acides 
amines deduite de - la— region integrase correspondant au 
clone 87-23. Dahs la Figure 15 // signifie un saut de 
trame qui a ete supprime pour restituer l'ORF potentiel. 
Les lettres en caracteres gras soulignees representerit les 
10 acides amines conserves des integrases retrovirales . 

La Figure 16 decrit les sequences nucleotidique et 
peptidique du clone B13 (identique au clone FBdl3 decrit 
dans des demandes anterieures) avec indication des ORFs et 
des codons stop representes par un point. La region 
15 soulignee en gras represente le domaine immunosuppresseur 
potentiel. Le domaine souligne en simple represente le 
■--7 debut du LTR3 1 . 

EXEMPLE 1 : OBTENTION D'UNE REGION CL6-5 • CODANT POUR 
20 L'EXTREMITE* . N-TERMINALE DE L' INTEGRASE ET- D'UNE REGION 
CL6^3 1 CONTENANT LA SEQUENCE 3 ' TERMINALE DU GENOME MSRV-1 

^ - v Une 3 f 'RACE a ete effectuee sur de I'ARN total 

extrait de plasma d'un patient atteint de SEP. Un plasma 

25 temoin sain> traite sous les memes condition^, a et§ 
1 utilise comme controle negatif. La synthese de cDNA a ete 

£ realisee avec une amorce oligo dT identifi&e par 
' SEQ. ID NO: 68 (5'-GAC TCG CTG -CAG ATC GAT TTT TTT TTT TTT 

^ TTT. jTo 3')r-- et; la transcriptase inverse "Expand™ RT" de 

30 . Boehringer selon les conditions preconisees par la 
societe. Une PGR a ete effectuee avec 1' enzyme Klentaq 
(Clontech^ sous' les conditions suivantes : 94 °C 5 min puis 
93°C: 1 min/ 58°C l^min, 68°G^ 3 min pendant 40 cycles et 
68°C pendant 8 min, *avec un volume reactionnel final de 

35 50 Ml. 

Amorces utilisees pour la PCR: > 



WO 99/02666 



PCI7FR98/01460 



20 

- amorce 5\ identifiee par SEQ ID NO: 69. 

5 • GGC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3 ' ; 

- amorce 3 1 , identifiee par SEQ ID NO: 68 

Une deuxxeme PCR dite "semi-nichee" a ete 
5 realisee avec une amorce 5 1 situee a I'interieur de la 
region deja amplifiee* Cette . deuxieme PCR a ete effectufee 
sous les meme^ conditions experimentales que celles 
utilisees lors de la premiere PCR, en utilisant 10 pi du 
produit .d' amplification issu de la premiere PCR. 
10 Amorces utilisees pour la PCR semi-nichee: 

- amorce 5 1 , identifiee par SEQ : ID NO: 70 

5.' CCA ATA GCC AGA CCA TT A TAT ACA CTA ATT 3 1 ; 

- amorce 3 1 , identifiee par SEQ ID NO: 68 

Les amorces SEQ ID .NO: 69- , et SEQ ID NO: 7 0 sont 

15 specif iques de la region pol de; MSRV-1 . 

Un produit d ■ amplification de 1,9Kb a ete obtenu 
pour le plasma de patient SEP. Le fragment correspondant 
n r a pas ete observe pour le plasma temoin sain. Ce produit 
d 1 amplification a ete clone de la fagon suivante : 

20 1 1 ADN amplifie a. ete insere dans un plasmide a l'aide du 
kit TA Cloning®. Les 2 /xl de solution d'ADN ont ete 
melanges avec 5 /xl d'eau distillee sterile, l/xl d'un 
tampon de ligation 10.. fois concentre "10X LIGATION 
- BUFFER", 2 /xl de "peR^ M VECTOR"; (25 ng/ml), et 1 jxl de "T4 

25 DNA LIGASE". Ce melange a ete incube une nuit a 12°C. Les 
etapes suivantes ; - ont ete i . rrealisees conformement aux 
instructions du kit rTA Cloning® (Invitrogen) . A la fin de 
la procedure, les colonies blanches de bacteries 
recombinanfces .(white) ont t ete repiquees pour etre 

30 cultivees . et permettre - . JL ■ extraction des plasmides 
incorpores selon la .... procedure? dite de "miniprep" . La 
preparation de plasmide de; chaque cplonie; recombinante a 
<§te* coupee par une enzyme , de restriction appropriee et 
analysee sur gel dVagarose. Les plasmides possedant un 

35 insert detecte sous lumiere UV apres marquage du gel au 
bromure d*§thidium, ont 6te selectionnes pour le 
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sequengage- de 1» insert, apres hybridation avec une amorce 
complementaire du promoteur Sp6 present sur le plasmide de 
clonage du TA cloning kit®. La reaction prealable au 
sequengage a ensuite ete effeetuee selon la methode 
5 preconisee pour 1 •utilisation du kit de sequengage 
"PRISM™ Ready Reaction AmpliTaq® FS, DyeDeoxyTM 
Terminator" (Applied Biosystems, 7'ref. 402119) et le 
sequengage automatique a ete realise sur les appiareils 
373 A et 377 Applied Biosystems, selon les instructions du 

10 fabricant. 

Le clone obtenu, contient une region CL6-5'codant 
pour l'extremite N- terminale de l'integrase et une region 
CL6-T3 1 , correspondant a la region 3' terminale de MSRV-1 
et permettant .de definir. la f in. de 1* enveloppe (234 pb) et 

15 les regions U3 , R (401 pb) du retrovirus MSRV1. 

La region correspondant a l'extremite N terminale 
de l'integrase est representee par- sa sequence 
nucleotidique (SEQ ID NO: 112) dans la figure 27. Les 
trois trames de lecture potentielles sont presentees par 

20 leur sequence amineoacide sous la sequence nucleotidique, 
et la sequence amineoacide de 1' extremite N-terminale de 
l'integrase est identifiee par SEQID NO: 113. 

La region C16-3 ' est representee par sa sequence 
nucleotidique (SEQ ID NO: 114) dans la figure 3. Les trois 

25 trames de ;\ lecture- potentielles sont;' presentees par leur 
sequence amineoacide sous la sequence: nucleotidique. Une 
sequence amineoacide correspondant a l'extremite 
— C-terminale de la; proteine env de MSRV^l est identifiee 
par SEQ ID -NO:. 115 . ,v- - 

30 Afin d'evaluer I'activite promotrice du LTR obtenu 

a partir de clone 6 (cl6) un test d'activite promotrice 
utilisant 1 1 enzyme : - CAT (chloramphenicol acetyl 
transferase) a ete effectue avec la region U3R 
correspondante . En parallele un clone cpntenant la meme 

35 region U3R d'ARN retroviral endogene exprime dans le 
placenta normal (PH74) et un clone (5M6) provenant d • ADN 
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ont ete testes. Le resultat presente dans la figure 12 
montre une tres „ forte aetivite promotrice du LTR issu de 
plasma SEP (cl6) et une aetivite signif icativement 
beaucoup : plus faible ayec les sequences d ' origine endogene 
5 non SEP. 

EXEMPLE 2 ; OBTEfjTION DU CLONE C15 CONTENANT LA REGION 
GOD ANT POUR UNE ; PART IE DE: L • ENVELOPPE DU RETROVIRUS- MSRV-1 

10 Une RT-PCR a ete effectuee suir der ~ l ' ARN total 

extrait dei . virions concentres par ultracentrif ugation a 
partir d'un surnageant . de culture de synoviocytes 
provenant : d'un patient PR. La synthese de cDNA a ete 
rSalisee avec une amorce bligo dT et la transcriptase 

15 inverse ''Expand™ RT ,, l de Boehringer selon les conditions 
preconisees ; par Ta societe. Une PCR a ete effectuee avec 
.1* Expand™ Long Template . PGR System (Boehringer) sous les 
conditions suivantes : 94 °C 5 min puis 93 °G 1 min, 60°C 1 
min; 68°;G, 3 min pendant 4 0 cycles et 68°C pendant 8 min et 

20 avec un. volume treactionnel final de 50 /xl. 
Amorces utilisees- pour la> PCR: 

- amorce 5 f , idehtilfi&e par SEQ ID NO: 69 

5 ■•; :GCG: ATC AAG ;GGA- CGC: AAG AAC TGT TAA CTT 3' ; 
.-;.,amorce «3 -.Up ideritif iee par r SEQ ID r NO: 116 ', 
2 5 5 1 TGG GGT TGGe ATT t TGT AAG ACC ATC TGT AGC TT 3* 

oUhe rr> deuxieme- PCR date "semi-nichee" a ete 
,realisee avec une amorce 5' situee a 1 * interieur de la 
:; region deja amplif iee.v Cette deuxieme PCR a ete effectuee 
sous les memes conditions experimentales que celles 
30. ;ut£lis6es-"-l6rs^rd^^la>.prem±dre. PCR (sauf que 30 cycles oht 

; §te :r6al is§s< au 1 ieu de .4 0 ) ,< en utilisant 10 /xl du produit 

d 1 amplification issu de la premiere PCR. 
Amorces utiliseesapour la -PCR semi^nichee: 

- amorce 5 1 identif i§e par SEQ ID NO: 70 

35 5' CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3' ; 

- amorce 3 f idehtifi^e par SEQ ID NO: 116 \ 
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Les amorces SEQ ID NO: 69 et SEQ ID NO: 7 0 sont 
specif iques de la region pol de MSRV-1. L 1 amorce SEQ ID 
NO: 116 est specifique de la sequence FBdl3 (aussi denomme 
B13) et est localisee dans la region env conservee parmi 
5 les oncoretrovirus. 

Un produit d 'amplification die 1932 pb - a ete 
obtenu et clone ? de la fa<?on suivante : 

1 1 ADN amplifie a ete insere dans un plasihide a l-'-aide du 
kit TA Cloning®. Les differentes etapes ont ete realisees 

10 conformement aux instructions du kit TA Cloning® 
(Invitrogen) . A la fin de la procedure, les colonies 
blanches de bacteries recombinantes (white) ont ete 
repiquees pour etre cultivees et permettre 1' extraction 
des plasmides. incorpores. selon la procedure dite de 

15 Vininiprep" , La preparation de plasmide de ehaque coloriie 
recombinante a ete coupee par une enzyme de restriction 
, appropriee et analyses sur gel d* agarose; Les plasmides 
possedant un insert detecte sous lumiere UV apre^ marquage 
du gel au bromure d 1 ethidium, ont ete selectionries pour le 

20 sequengage de l 1 insert, apres hybridation avec urie amorce 
complementaire du promoteur SP6 present sur le plasmide de 
clonage du TA cloning kit®. La reaction prealable au 
sequengage a ensuite ete effectuee selon la methode 
preconiseeV pour 1 •utilisation du ; kit de sequengage 

25 "PRISM™ Ready Reaction AmpliTaq® FS, DyeDeoxy™ 
Terminator" (Applied Biosystems, ref. 402119) et le 
sequengage automatique a 6te realise sur les appareils 
373- A et 377 Applied Biosystems, selon les instructions du 
fabricant. ^ --• ::r ■ 

30 Le clone C15 obtenu, contient une region 

; : correspondant a la region : "de 1 'enveloppe de MSRV-1, de 
1481 pbV 

La region env du clone CIS est representee par sa 
sequence nucleotidique (SEQ ID NO: 117) dans la figure 5. 
35 Les trois trames de lecture potentlelles de ce clone sont 
presentees par leiir sequence amineoacide sous la sequence 
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nucleotidique. La trame de lecture correspondant a une 
proteine env structurale MSRV-l est identifiee par 
SEQ ID NO: 118. 

^ r A partir des sequences definies. provenant des 

5 clones cl6 et C15, une construction plasmidique a- pu etre 
effectuee codant pour une enveloppe complete suivie du 
LTR3 1 , comme pr^sente dans la figure 13: avea la trame de 
lecture correspondante. 

10 EXEMPLE 3 : (DETENTION D • UN CLONE 5M6 CONTENANT LES 
.- SEQUENCES DE LA REGION 3 ■ TERMINALE DE L 1 ENVELOPPE , 
SUIVIES DES SEQUENCES U3,R,U5 DE TYPE PROVIRAL MSRV-l . 

Une PCR monodirectionnelle a ete effectuee sur de 
15 1 • ADN extrait de lymphocytes B immortalises en culture 
d r un .patient PR . La PCR a ete effectuee avec 1* Expand™ 
Long Template PCR, System (Boehringer) sous les conditions 
suivantes : 94°C 3 min puis 93°C 1 min, 60°C 1 min, 68°C 3 
min pendant 10 cycles , puis 93°C l min, 60°C 1 min avec 
20 15 .sec^d'extensipn a chaque cycle, 68 °C 3 min pendant 35 
cycles et 68?C pendant 7 min ^et avec un volume reactionnel 
final. de 50 #1. ; . ~ 

-r Vj L 1 amorce utilisee pour la PCR identifiee par 
r r : SEQ;ip NO: 11 9 , est 5* TCA AAA TGG AAG AGC TTT AGA :CTT GCT 
25 AAC CG 3 1 r - 

L' amorces SEQ ID .NO: 119 est specif ique de la region env 
du clone CI 5. 

; o r y t Un produit d 1 amplification de 1673 pb a ete 
obtenu et clone de la fa^on suivante : 
30 ; VADN tamplif i6 a ete insure dans un plasmide a l'aide du 
kit TA Cloning®. Les differentes etapes ont ete realisees 
conformement aux instructions du kit TA, Cloning® 
; (Jnyijtrogen)> A la fin de la procedure, \ les colonies 
blanches de bacteries recombinantesr (white) ont ete 
35 repiqufees pour etre cultiyees et permettre l 1 extraction 
des plasmides incorpores -selon la procedure dite de 
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"miniprep" . La preparation de plasmide de chaque colonie 
recombinante a ete coupee par une enzyme de restriction 
appropriee et analysee sur gel d 1 agarose. Les plasmides 
possedant un insert detecte sous lumiere UV apres marquage 
5 du gel au bromure d * ethidium, ont > ete select ionnes pour le 
sequen?age^ de 1' insert, apres hybridation avec une amorce 
complementaire <4u promoteur T7 present sur le plasmide de 
clonage du TA cloning kit®. La reaction prealable au 
sequengage a ensuite ete eff ectuee selon la methode 

10 preconisee pour - 1 ' utilisation du kit de sequengage 
"PRISM™ Ready Reaction AmpliTaq® FS, DyeDeoxy™ 
Terminator" (Applied Biosystems, iref . 4021191) et. le 
sequengage automatique a. ete realise sur les appareils 
373 A et 377 Applied Biosystems, selon les instructions du 

15 fabricarit. 

Le clone 5M6 obtenu, contient une region 
correspondant a la region 31 de l'enveloppe de MSRV-1, de 
492 pb suivi-des regions U3 , R et U5 (837 pb) de MSRV1-. 

i Le clone 5M6 est represents par sa sequence 

20 nucleotidique (SEQ ID NO: 120) dans la figure 5. Les trois 
trames de lecture potent ielles de ce clone sont presentees 
par leur sequence amineoacide sous la sequence 
nucleotidique i La trame de lecture correspondant a 
l!.extremite C-terminale de la proteine;; env MSRV-1 est 

25 identifiee par SEQ ID NO: 121. 

EXEMPLE 4 : OBTENTION DU CLONE LB16 GONTENANT LA REGION 
CODANT L 9 INTEGRASE DU RETROVIRUS MSRV-1 

.Une RT-PCR ',a ete eff ectuee sur de 1 ' ARN total 
30 , traite a la DNAsel et extrait a partir d'uh plexus 
chproide . provenant d'un patient. SEP. La synthese de cDNA a 
6te realisee avec une amorce oligo dT et la- transcriptase 
inverse VExparid™ RT" de; rBoehringer selon les conditions 
preconisees par la societe. Un contrdle "no RT M • a 6te 
35 effectue parallelement sur le mime materiel. Une PCR a ete 
eff ectuee avec la Taq polymerase (Perkin Elmer) sous les 
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conditions suivantes : 95°C 5 min puis 95°C 1 min, 55°C 1 
. min, 72 °C 2 min pendant 3 5 cycles et 72 °C pendant 8 min et 
avec un volume reactionnel final de 50 ill. 
Amorces utilisees pour la PGR: 
5 - amorce 5', identifiee par SEQ ID NO: 122 
5' GGC ATT GAT AGC ACC CAT CAG 3 1 ; 
- amorce 3', ideritifiee par SEQ ID-NO: 123 
5 ' .CAT GTC ACC AGG GTG GAA TAG 3* 
/ L'amorce SEQ ID NO: 122 est specif ique de la 

10 ■; region pol de MSRV-1 et plus precisement similaire a la 
region integrase- decrite- precedement. L'amorce 
SEQ ID NO 123 a ete def inie sur des sequences des clones 
obtenus lors d'essais prealables. 

Un produit d 1 amplification d 1 environ 7 60 pb a ete 
15 obtenu uniquement dans I'essai avec RT et a ete clone de 
• v . la fa?:on suivante : 

l'ADN amplifie a ete insere dans: un plasmide a l*aide du 
kit ; TA Cloning®. Les differentes etapes ont ete realisees 
r : conf ormement aux instructions du kit TA Cloning® 
20; (Invitrogen) . A la ; fin de la procedure, les colonies 
blanches i de bacteries reicombinantes (white) orit ete 
:;repiquees pour etre* cultivees et permettre 1' extraction 
des plasmides iricorpores selon* la ' procedure dite de 
"mihiprep". La preparation de plasmide de chaque eolonie 
25 recombinante a ete coupee par urie enzyme de restriction 
appropriee et analysee sur gel d • agarose . Les plasmides 
'"3 ^pbssedaht un insert detecte sous lumiere UV apres marquage 
du gel au bromure d ''ethidium, ont §te selectionnes pour le 
. T sequen<?age de 1 1 insert; apres hybridation avec une amorce 
30 :Compl6mentaire du promoteur T7. "present- sur le plasmide de 
1 clonage" 'du TA cloning- kit®. La reaction prealable au 
s6quen<?age a ensuite 6fc<§ effectuee selon la* methode 
pr6conis6e pour -1 ^utilisation du kit-*- de sequenq:age 
- "PRISM™ Ready Reaction ' AmpliTaq® FS, Dyebeoxy™ 
35- Terminator" (Applied Biosystfems, ref. 402119)- et le 
sequengage automatique a ete * realise sur: les appareils 
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373 A et 377 Applied Biosystems, selon les instructions du 
fabricant. 

lie clone LB 16 obtenu, contient les sequences 
correspondant a I'integrase. La sequence - nucleotidique de. 
5 ce clone est identifiee par SEQ ID NO: 124 sur la figure 

11 , trois trames~de lecture sont determinees. 

% 

EXEMPLE 5 : OBTENTION D/UN CLONE 2, CL2 , CONTENANT EN 

3 1 UNE PART IE HOMOLOGUE AU GENE POL, CORRESPONDANT AU GENE 
10 PROTEASE, ET AU GENE GAG . (GM3 ) CORRESPONDANT A LA 
NUCLEOCAPSIDE, ET UNE NOUVELLE REGION 5 1 CODANTE, 
CORRESPONDANT AU GENE GAG PLUS SPECIFIQUEMENT LA MATRICE 
ET LA CAPS IDE de MSRV-1. 

15 Une amplif ieation par PCR a ete effectuee sur de 

1 1 ARN total extrait a partir de 100 fil de plasma d'un 
patient atteint de SEP. Un temoin eau, traite sous les 
memes conditions, a ete utilise comme controle negatif . La 
synthese de cDNA a ete reetlisee avec 3.00 pmole d'une 
20 amorce aleatoire (GIBCO-BRL, France) et la transcriptase 
inverse "Expand RT" ■. (BOEHRINGER MANNHEIM, France) selon 
les conditions preconisees par la societe. Une 
amplif ieation par : PCR X" polymerase chain,, reaction w ) a 
&te effectuee avec L' enzyme Tag polymerase' (Perkin Elmer, 
25 France): ; en utilisaht 10 fi-1 de cDNA sous- k les conditions 
suivantes: 94°C 2 min, 55°C 1 min, 72°C 2 min puis 94°C 
1 min; 55°C 1 min, 72°C 2 min pendant 30 cycles et 72°C 
pendant .7 min^. et . avec un volume * react ionnel f inal de 

50,./il.. : ru* . -*r ^ •. V- 

30 Amorces utilisees pour 1 1 amplif ieation par PCR: „ 

- amorce: 5 1 identifiee par SEQ ID NO: 126 

5 * CGG ACA TCC AAA GTG ATG GGA AAC G 3» ; : 

- amorce 3 identifiee par SEQ ID NO,: 127 c 
5 1 GGA CAG GAA AGT AAG ACT GAG AAG GG 3 ' 

35 Une deuxieme amplification par PCR dite "semi- 

nxchSe" a ete. realisee avec une amorce 5' situee a 
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l'interieur de la region deja amplifiee. Cette deuxieme 
PCR a ete effectuee sous les memes conditions 
experimentales que celles utilisees lors de la premiere 
PCR, en.utilisant 10 pi du produit d 1 amplification issu de 
5 la premiere PCR.- 

Amorces utilisees pour 1 'amplification par PCR. semi- 
nichee: 

- amorce 5 • , identif iee par SEQ ID NO: 128 

5 1 CCT AGA ACG TAT TCT GGA GAA TTG GG 3» ; 
10 - amorce 3 1 , identif iee par SEQ ID NO: 129 

5 • TGG CTC TCA ATG GTC AAA CAT ACC CG 3 1 

Les amorces SEQ ID NO: et SEQ ID NO: sont 

specif iques de la region pbl, clone G+E+A, plus 

specif iquement la region E: position nucleotidique n° 423 
15 a n? 448* Les amorces utilisees dans la region 5' ont ete 

definies sur des sequences de clones obtenus lors d'essais 

prealables. 

Un produit d 'amplification de 1511 pb a ete 
obtenu a partir de 1'ARN: extrait du plasma.de patient SEP. 

20 Le fragment correspondant n'a pas ete observe pour' le 
* temoin eau. Ce produit d ' amplification a ete clone de la 
fa<?on suivaLnte. 

L'ADN amplifie a ete insere dans un piasmide a 
I'aide du kit TA Cloning™. Les 2. /xl/de solution d'ADN ont 

25 et§ melanges avec : 5 pi d ' eau distillee sterile^ 1 /zl d'un 
tampon 'de ligation 10 fois concentre "10X'- LIGATION 
BUFFER", 2 /il de- "pCR™ VECTOR" (25 ng/ml) et 1 |il de "T4 
-DNA LIGASE" . Ce melange a 6te ineube une nuit a 14 °C. Les 
etapes suivantes ont ete rea Usees conformemerit au 

30 instructions du kit TA Cloning® (Invitrogen) . Le melange a 
6te etale apres transformation de la ligation dans des 
bacteries E* cdli. INVaF 1 . A la fin de la procedure, les 
colonies blanches de C .bacteries recombinantes ont ete 
repiquees pour< etre cultivees et permettre 1' extraction 

35 des plasmides incorpor€s selon . la " procedure dite de 
"minipreparation d'ADN'? (17) . La preparation de piasmide 
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de chaque colonie recombinante a ete coupee par 1* enzyme 
de restriction Eco RI et analysee sur gel d 1 agarose. Les 
plasmides possedant uri insert detecte sous lumiere UV 
apres marquage du gel au bromure d 1 ethidium, ont ete 
5 selectionnes pour le sequengage de l 1 insert, aprds 
hybridation avec une amorce complementaire du promoteur T7 
present sur le plasmide de clonage du TA cloning kit®. La 
reaction prfealable au sequenqiage a bnsuite ete effectuee 
selon la- methode -preconisee- pour 1 •utilisation du kit de 

TM 

10 sequengage "PRISM Ready Reaction Amplitaq® FS, 

TM 

DyeDeoxy Terminator" (Applied Biosystems, ref. 402119) 
et le sequen<?age automatique a ete realise sur les 
appareils 373 A et 377 Applied Biosystems > selon les 
instructions du fabricant. 

15 Be clone obtenu, denomme GL2 , contient unei region 

C-terminale similaire a la region 5' terminale des -clones 
G+S+A de MSRV-l, qui permet: de definir la region C- 
terminale du gene gag et une nouvelle region 
correspondante a la region J N- terminale du gene gag MSRV-l. 

20 CL2 permet de definir tine region de 1511 pb 

presentant une phase ouverte de lecture dans la region N- 
terminale de 1077 pb codante pour 3 59 acides: aminees et 
une phase non-ouverte de lecture, de 454 pb, correspondant 
- S la ^r6gion C-terminale Tdu gene gag MSRV-l. 

25 - ^La ; sequence nucleotidique de CL2 est identifier 

par SEQ ID NO:: 13 0. Elle est representee a la figure 6, 
avec les ' trames de lecture potentielles en amiriebacide. 

: Le ^fragment de > 1077 pb de CL2 codant pour 359 

r acides amines- a ete amplif ie par PCR avec 1* enzyme Pwo 

30 (5U//il) (Bbehririger ^Mannheim, Fraricie) en utilisant de 
la. minip^ireparation ide 1 1 ADN du clone 2 sous les conditions 
suivantesi 95°0 1 min, 60°G 1 min, 72°C 2 mih pendant 25 

? cycles let ayeeviun volume reactionnel final de 50 /zl a 
1* aide des amorces: 

35 - amorce 5' (Bam HI) , identifie par SEQ ID NO: 132 
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5' TGC TGG. AAT TCG GGA TCC TAG AAC GTA TTC 3' (3.0 mer), et 

- amorce 3' (Hind III), identifie par SEQ ID NO: 133 

5 AGT TCT GCT CCG.AAG CTT AGG CAG ACT TTT 3' (30 mer) 

correspondent, respectivement, a la sequence nucleotidigue 

5 du clone 2 en position -9 a 21 et 1066 a 1095. 

Le fragment obtenu apres PGR, a ete linearise par 

Bam HI et; Hiiidlll , et sous-clone dans les vecteurs 

d' expression pET28C et , pET21C- (NOV AG EN) linearise par Bam 

HI et Hind III; Le sequengage de 1 1 ADN du fragment de 1077 

10 pb du clone 2 dans les deux vecteurs d 1 expression a ete 

realise selon. la methode preconisee pour 1 1 utilisation du 

TM 

kxt de sequenfage "PRISM ( . ; Ready Reaction Amplitaq® FS, 
TM 

DyeDeoxy Terminator" (Applied Biosystems, ref . " 402119) 
et le sequengage automatique- a ete .realise sur les 

15 appareils 373 A et 37.7 Applied Biosystems, selon les 
instructions du fabrieant. 

L 1 expression a de * , la - sequence . nucleotidique du 
fragment de 1077 pbr du ; clone ;£2 par les vecteurs 
d» expression pET28C et . - pET21C sont identif iees ' par 

20 respectivement SEQ ID NO: - 135 et SEQ ID NO: 137 . ■ 

EXEMPLE 6 : EXPRESSION DU CLONE 2 CHEZ ESCHERICHIA COLI 

Les constructions j pET28crclone- ;2 r (1077 ^pb) et 
25 pET21C-clpne 2< (1077 i ;pb) synthetisent, - dans la souche 
baeterienne BL21 (DE3) , une prdt e ine:. .en c .fusion. N- - et C- 
terminale pour le : Vecteur pET28C et C-Terminale pour le 
( vepteur -? pET2 1G: avec 6 Histidines ; de masse moleculaire 
. /. apparent e d!envirpn ;: 4 5 ^r kDa^n- mise , en . Evidence:'? par 
30 electrpphpr&se sur gel*, de polyacry lamide tSDS-PAGE . (SDS = 
Dpdecyl Sulfate de . Sodium):, J (Laemmli, :r!9^0 (1))V La 
r : reactivite de la proteine a ete mis.e eri; evidence vis a vis 
d^urv anticprps:- monoclonal ^anti-Histidine:v(DIANOVA) par la 
technique de Western blot (Towbin et al. , :;.-197-9- (2) ) ♦ 
35 Les proteines recombinantes pET28c-rclone 2 

(1077 pb) et pET21C-clone 2 (1077 pb) ont ete visualisees 
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en SDS-PAGE dans la fraction insoluble apres digestion 
enzymatique des extraits bacteriens avec 50 ^1 de lysozyme 
(10 mg/ml) et lyse par ultrasons. 

Les . propri§tes : antigeniques des antigenes 
r 5 recombinants pET28e-clone 2 (1077 pb) et pET21C-clone ; 2 
(1077 pb) ont ete testees par West en Blot () .apres 
solubilisation du culot bacterien avec 2% SDS .et 50 : mM fi- 
ller cappoethanol * Apres incubation . avec les sequins de 
patients^ ^tteints de sclerose en plaques, les serums des 
10, temoins neurologiques et les serums de temoins ;de ^centre 
de transfusion sanguine^ (CTS) , les immunocpmplexes ont ete 
detectes a l'aide d'un serum de chevre anti-IgG et : anti 
IgM humaines, couple a la phosphatase alcaline. , >fc/ . 

Les resultats sont presentes dans le tableau ci- 
15 apres. . ... ,. . . . ; . . ... . , c - 

- . TABLEAU. ;KC 

Reactivite de serums atteints de sclerose en plaques et 
temoins avec la proteine recombinante MSRV-1 gag clone 2 
(1077 pb) = pET21C-clone 2 (1077 pb) et pET28C-clone 2 
20 , . ■ t (1077 pb) a ^ ^ ; 

V MALADIE - NOMBFtE ) ; ;NOMBRE " | J; * 
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specif ique sur les bandelettes. Des controles positifs et 
negatifs sont incliis dans chaque experience. 

" Ces resultats montrent que/ dans les conditions 

5 techniques utilis§es, environ' 40% des serums hiimains 
atteints de sclerose en plaques testes reagissent avec les 
proteines recombinarites pET28C-clone 2 (1077 pb) et 
pET21C-clone : 2 (1077 pb) . tJne reactivite a et4" observee 
sur un temoih neurologique et il est interessant de~noter 

10 " que les ARN extraits a partir de ce - serum, apres 1 ' etape 
de transcriptase inverse, sont aussi amplifies par PCR 
" - dans la region pol ; Ceci suggere que des persorines n 1 ayant 
pas declare une SEP peuvent egalemfent Heberger et exprimer 
ce virus. Pair contrei, un t&moin (dbnnelir CTS) apparemment 

15 sain, possede des anticorps anti-gag (clone 2, 1077 pb). 
Ce qui est compatible avec un immunite acquise contre 
MSRV-1 en dehors d'une maladie autoimmune associee 
declaree. r - ■■ •■*": ■ ■ * 

20 EXEMPLE 7 : OBTENTION D*UN CLONE LB13 CONTENANT EN "3 ' 
UNE PART IE HOMOLOGUE AU CLONE 2 CORRESPONDANT AU GENE GAG 
ET EN 5' UNE PARTI E HOMOLOGUE AU CLONE 5M6 CORRESPONDANT A 
<-;. ISA REGION ;LTR 3J5l. .\ 

25) t Une RT-PCR ( u reverse-transcriptase-polymerase 

chain reaction w ) a 6te effectuee a partir de l v ARN total 
extrait de virions, provenant de surnage'arits de cellules 
lymphocytaires B des patients atteints de sclerose en 
plaques; concentres par ultracentrifugations.! La syrithese 

3 0_ de „cDNA a 6te, .realisee ayec une amorce , spe.cif ique. .SEQ N° 

TM 

•v XXX Let ; la transcriptase inverse" Expand RT " de 
BOEHRINGER MANNHEIM' selon les conditions £recohisees par 
la soeiete^- - " " - ' - 

> Amorce utilisee pour la syhthese dii cDNA> idehtifiee par 
35 SEQ ID NO: 138 : 

5 1 CTT GGA GGG TGC ATA ACC AGG GAA T 3 1 
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Une amplification par PGR a ete realisee avec la 
Taq polymerase (Perkin Elmer, France) sous les conditions 
suivantes: 94°C 1 min, 55°C 1 miri, 72°C 2 min pendant 35 
cycles et 72 °C pendant 7 min et avec uri volume r&actionnel 
5 f inal de 100 /til. 

Ambrcies utilis^es pour 1* amplification pair PCR: : 

- amorce 5 • ■, "identif iee par SEQ ID NO: 139 
5 • TGT CCG CTG TGC TCC TGA TC 3 1 

- amorce 3*; identifiee par SEQ ID NO r 138 
10 5' GTT GGA GGG TGC ATA ACC AGG GAA T 3' 

Une deuxieme amplification par PCR dite w semi- 
nichee " a ete realisee avec line amorce 3 * situee a 
l'int€rieur de la region deja amplifiee. C£tte deuxieme 
amplification a Sfce effeetuee sous les memes contitions 
15 experimentales que celles utilisees 'lors de la premiere 
amplif icatidn, en utilisant 10 m*1 du ptoduit 
^•amplification issu de la premiere : PCR • 

Amorces utilisees por 1 'amplif icatioh par PCR ~ " semi- 
nichee - : ■ >■ . 

20 - amorce 5 r , identifiee par SEQ ID NO: 139 
5 V. TGT CCG CTG TGC TCC TGA TC 3 '*' 

- amorce 3 1 , identifiee par SEQ ID NO: 140 
■5?\CTA TGT CCT TTT GGA CTG TTT GGG T 3' 

lies 'amorces SEQ ID NO: 138 et~ SEQ ID NO: 140 sont 
25 specif iques de la region *jag, * clone 2 position 
nuclebtidique n° 373-397 et n 6; 433-456. Les • amorces 
utilisees Mans la region 5 1 ont ete def inies stiir des 

- sequences ^des clones obtenus lors d ' essais prealables. 

Uri produit d 'amplif ication de 764 pb a ete obtenu 
30 et clone de la f a?on suivante: r^: ; 

- - ■ v - L'ADN amplif ie r a ete ihsere dans un plasmide a 
- !• aide du-kit'TA Cloning^. Les 2' Ml 'de solution d • ADN ; ont 
" i 6te melanges aivec 5; Ml d r eau distillee^ sterile, 1 m : 1 d'un 

tampon de ligation 10 fois concentre "10X LIGATION 
35 BUFFER", 2 Ml de "pCR T ^ VECTOR" (25 ng/ml) et 1 m1 de "T4 
DNA *LIGASE". Ce melange a eti§ incube une nuit a 14 °G. Les 
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etapes suiyantes qnt ete realisees cpnf ormement au 
instructions du kit TA Cloning® (Invitrpgen) . 1: Le melange a 
ete etale apres transformation de la ligation dans des 
Ipacteries E. coll INVaF' . A la fin de la procedure, les 
5 colonies blanches de bacteries recombinantes ont ete 
repiguees pour etre . cultivees et permettre. 1* extraction 
des plasmides incorpores selon la procedure dite de 
"minipreparation d 1 ADN" (17) . La preparation -de; plasmide 
de chague colonie recombinante a ete- eoupee par l 1 enzyme 

10 de restriction Eco RI et analysee sur gel d ' agarose .. Les 
plasi^ides possedant un insert detecte sous lumiere UV 
apres marguage du gel au bromure d'ethidium, ont; ete 
selectiorines pour le seguentpage de 1" insert, apres 
hybridation ,avec une -amorcej : cpmpleinentaire du promoteur T7 

15 present sur le plasmider de ; clonage du TA cloning kit®. La 

reaction prealable au seguenpage a ensuiite ;ete effectuee 

selon la methode precpnisee .pour 1' utilisation ;du^ }cit de 

TM 

sequen<?age ; "PRISM Ready; Reaction Amplitaq® FS, 

TM 

DyeDeoxy Terminator" (Applied Biosystems, ref . : 402119) 
20 et le sequenpage automatique . a ete realise sur les 
appareils 373 A et 377 Applied Biosystems, selon les 
instructions du fabricant. — 

Le clone LB13 pbtenu ppntient urve;_ region N- 
v.ff .;terminale ide^ gene gag MSRVrl hoinologuer au clone 2 et un 
2f> LTR W correspondent a , une partie de la region , U5 . Entre la 
region OJ5 et gag un site de fixation pour les ARN de 
trainsfert,: le PBS w primer binding site - a ete identifig. 
^ :>/ < Cf , La. , sequence, nucleptidique du fragment des, ,764 pb 

TM 

f , ... ;dft ^lone LB13 dans le plasmide : , H pGR vector" est 
30 representee dans 1 • identif icateur- SEQ ID WO: 141. i 
, : Le site de .fixation pour les ARN de transfert, 

, r? presentant une sequence du type PBS ..tryptophane, a ete 
: , M : i dent i fie en r position nucleo^idique , -n?.3 4 2-35-9; <*u .clone 
- -LB13. .. : : ., • 

35- . „ t Comme ce. meme PBS - ;a; €t6 r,etrquv§ dans les copies 
endogdnes homologues a MSRV1, la r f amille endogene; ainsi 



WO 99/02666 



PCT/FR98/01460 



35 

definie est appelee dorenayant HERV W, selon la 
nomenclature proposee pour les families de retrovirus 
endogenes (W=Tryptophane) . 

Une ORF rcourte d^environ 65 . aeides amines a ete 
5 retrouvee dans la region U5: du LTR 5* du 1 clone LB13 . 

Sequence de 1 *GRF : 
PMASNRAITLTAWSKIPFLGIRETKNPRSENTRLATMLEAAHHHFGSSPPLSWELWEQ 
GPQVTIW. 

10 La sequence - nucleotidique correspondante debutant 

par un codpn ATG est susceptible d'etre exprimee dans un 
ADN sous genomigue a partir d'un LTR proviral (U3RU5)^. ; 

Un autre clone, denomm^ LAI 5 a ete obtenu sur 
15 1 1 ARN total extrait de : virions concentres par 
ultracentrifugation a partir d'un surnageant de culture de 
synoviocytes prpyenant rdfun y patient atteint de 
polyarthrite rhumatoide. La strategie d 1 amplification et 
clonage du; clone eLA15/ .est: .exactement r l^ meme qui a ete 

20 utilisee pour le clone LB13. -; . ^ v ^ ^ 

d : La sequence nucleptidigue du clone LA 15 qui est 

representee dans 1 • identif icateur SEQ ID NO: 142, est tres 
similaire au clone LB13 . Ceci suggere que le retrovirus 
MSVR-1 d^tecte dans la sclerose en ^plaque -presente des 
25 sequences similaires a r celles reneqntrees dans la 
polyarthrite rhumatpide. • . " *, 'j ; 

EXEMPLE 8 : RECONSTRUCTION D 'UNE REGION RU5-GAG A PARTIR 
DES CLONES LB15, LB13 , CL2 ET CL17 V;Jr: . ^ j 

30 ,. , t , - - , , ■ - • ■ -i ' -. • \: 

Les clones CL2 et LB13 ont deja ^te ; dj§crits 3 dans 
les exemples precedents. Le clone LB15 a ete obtenu en 
utilisant la sequence rR^du LTR; du , clone QlS, pour definir 
: une amorce e;n 5 1 et les amorces anti^sens L v utilisees sont 

35 les memes que pour le clone LB13. Le clone CL17 a et€ 
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obtenu par nested RT-PGR en utilisant les amorces 
suivantes : 

5 • -TCATGCAACTGCACTCTTCTGGTCCG-3 • (sens) 
5 5 • -TCTTGCACTAACCTCCACTGTCCGTTGG-3 ; ( anti-sens) 

5 ' -ATCc6cCAGTAACAATTTGGTGAGCACG-?3 • ( sens) 

5 • -TCGGGTGTAAGAGGGTACTTCGTTTGGTAGG- 3 • 1* ( anti-sens) 

10 Le clone LB15 a §te obtenu a partir de virions , 

obtenus par culture de cellules de SEP, Le clone LB17 a 
ete obtenu a partir de culture de plaisma de patient 1 SEP . 

Ces clones chevauchants ont permis de reconstruire 
<■ une sequence RU5-gag avec line ORF potential le dans le gene 
15 gag, telles que presentees a la figure 14 J ■ 

EXEMPLE 9; OBTENTION D'UN CLONE 87-2 3 ' 

La region correspohdarit a 1 1 integrate 0 a ete 
20 amplifiee et clonee a partir de plasma de SEP eh utilisant 
line semi-neistfed RT PGR avec les amorces suivantes situees 
dans les regions pol et eriv de MSRV1. 

^ ^ Dans ^a region por : r>> o 

25 5 — TTACGG AGGTCTC AGGG ATG AG CTT - 3 • (sens-PCR prima ire) 

5 1 — CGGCAGTAGCAGTCTTAGTATCTGAAGCAGTTA— 3 1 (sens-PCR ^ 

secondaire) 

Dans la region env, ^ 
30 5 1 -GGTACGGAGGGTTTCATGTAGTTTTGAG-3 1 (anti-sens PCR primaire 
^- n et ; secohdaire) ~" ' 

"■° lie -clohle r amplifi4 eomporte 774 pb dans lk- region 
^ pol/RT, toute la region integrate (1197 pb) et le debut de 
35 la r * region env ; (480 pb) . - La sequence nucleotidiqiie 
correspondant a la region int^grase et la traduction en 
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acides amines de l'ORF potentiel sont presentes a la 
figure 15. 

EXEMPLE 10 : CONFIRMATION DE LA PRESENCE D'ARN CONTENANT 
5 DES SEQUENCES ENV APPARENTEES A ERV9 DANS LES PARTICULES 
RETROVI RALES ASSOCIEES AU GENOME MSRV1 : 

Des sequences apparentees a ERV9 ont ete trouvees 
en proportion iriinoritaire dans les preparations du virion 

10 provenant de SEP par rapport aux sequences MSRV1. 
L 1 existence de phenomenes de co-encapsidation de sequences 
endogenes phylogeniquement proches dans des particules 
retrovirales produites par une souche replicative a ete 
decrite. De maniere surprenante une region d'ARN - 

15 comprenant une ORF commen?ant dans la partie 3 1 d'env et 
se continuant potentiel lement dans le LTR3 1 a ete 
retrouvee dans differents echantillons de SEP. Afin de 
preciser l 1 existence d'une ORF des essais de 
transcription-traduction ont ete realises et ont permis de 

20 montrer la realite d'une ORF env contenant toute la partie 
transmembranaire (TM) et se terminant au debut du LTR 
putatif . Cependant une trame additionnelle (ORFX) fait 
suite et se poursuit dans le LTR3 ' * Les deux produits 
d» expression ont 6t6 visualises et leurs ORFs respect ives 

25 ont ete sous donees. La figure 16 represente les 
sequences nucleotidique et peptidique du clone B13 deja 
decrit en precisant les ORFs dans la region env tronquee 
et dans le LTR putatif. La presence de tels ARN peut etre 
S l f origine de recoxabinaisons avec la souche replicative 

30 et par consequent generer des souches de pathogenecite 
modifiee. 



WO 99/02666 



PCT/FR98/01460 



38 

BIBLIOGRAPHIE 

(1) Laemmli U. K. Cleavage of structural proteins 
during the assembly of the head of bacteriophage T4. 

5 Nature. (1970)-. 227: 680-685. 

(2) Towbin 1 H. , Staehelin T. & Gordon J. 
Electrophoretic transfer of proteins from -polyacrylamide 
gels to nitrocellulose sheets: procedure and some 

10 applications. Proc. Natl. Acad. Sci. USA. (1979). 76: 
4350-4354. 



WO 99/02666 



PCT/FR98/01460 



39 

REVEND I CAT I ONS 

■ y: . 1. Materiel nucleique, a 1 • etat isole ou purifie, 

comprenant !p une sequence nucleptidique . choisie dans le 
5 groupe qui consiste en (i) les sequences SEQ ID NO: 112, 
? SEQ ID NO: 114, SEQ ID NO: 117, SEQ ID NO: 120, 

SEQ ID NO: 124, * . SEQ ID NO: 130, SEQ ID NO: 141 et 

SEQ ID NO: 142 r ; (ii) les sequences complementaires des 
sequences (i) ; et (iii) les sequences equivalentes aux 
1() sequences,^ (i) ou (ii) , en particulier les sequences 
presentant pour toute suite de- 100 monomeres contigus, au 
moins 50 %, et pref erentiellement au moins 70 % 
d? hoitiplog.ie avec respectivement les sequences (i), ou (ii)..... 

2. Materiel nucleique, a I'etat isole ou purifie, 
15: ; codant pour un polypeptide presentant, pour toute suite 
con^igue .d^u moins 30 acides amineees, au moins 50 %, et 
de preference, au moins 70 % dMiomologie, avec une sequence 
peptidique choisie, dans le . groupe qui consiste en 
SEQ IP NO: li3, , SEQ ID N°115, SEQ ID NO: 118, 

20 .SEQ.ID NO ( : 121,,, SEQ ip NO: 135 e£, SEQ ID NO: 137.. 

< 3.. ti M^teri(Bl nucleique retroviral, dont le t gene pol 
Qompren-d, une., sequence nucleotidique identique ou 
c eqjuiyplen^e , a une. sequence choisie dans le groupe qui 
consisite^ , r en ( SEIQ. ID NQ.:, ^12 , .SEQ , ID NO:, 124, et leurs 
25 . sequences qqmplementaires : . . . , , 

r ^ ;n rr 7^ Materiel nucleique . r^trpyiral , dont 1 1 extremity 
•Tm.L-o:^ u P? 1 commence au nucleotide 1419 de 

r SEQ ID, NO: ,130, 

5. Materiel nucleique retroviral, dont le g#ne pol 
}?. k sRS u ? Polypeptide presentant, pour toute suite 

cont^ acides amineees, au moins 50 %, et 

I ^ ^d^prefer^ce au moins 70 % d'homologie, avec la sequence 
, 7 peptidique^SEQ ID Np: 113 . 

. 6. u Materiel nucleique retroviral, dont 1» extremity 

5 3 f du g&ne gag finit au nucleotide 1418 de SEQ ID NO: 130. 
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7. Materiel nucleique* retroviral, dont le gene env 
comprend une sequence nucleotidique identique ou 
equivalente a une sequence choisie 'dans" le groupe qui 
consiste en SEQ TD NO: 117, "et ses sequences 

5 compleraentairesC r ' 

8. Materiel hucl&ique retroviral , dont le gene env 
comprend une— Sequence nucleotidique qui commence au 
nucleotide 1 de SEQ ID NO: 117 et finit au nucleotide au 

- nucleotide 23 3 de SEQ ID NO: 114* - 

10 9. Materiel nucleique retroviral, dont le gene env 

code pour un- : jpolypeptide presehtaht, pour toiite suite 
contigue d'au moins 30 acides amineees, au moins 50 et 
de preference afa ihoihs 70 % d'homblogie, avec la sequence 
SEQ ID Nd:°ii8. : * 

15 ^ 10\ ; Materiel nticleiqvie retroviral dbrit la iregion 

U3R du LTR 3* comprend une sequence nucleotidique qui se 
— termihe au nucleot:fde f ^6l 1 7 de SEQ ' ID NO: ir4~. " r 

11 i Materiel ' nucleique retroviral dbrit la region 
RUf> du LTR 5* comprend une sequence nucleotidiquis qui 

20 commence ; au - nucleotide^ 755 de SEQ ID NO: 120 et firiit au 
nuclebtide 337' Se 1 SEQ J ib NO: i41 r ou SEQ ID NO: 142. 

- 12. MaferxeY' nucleique' retroviral comprenant une 

sequence qui ; ' cbmmehce ; aii nuclebtide 755 de SEQ ID' Nb: 120 

" •■-"'-•'et qui se tefininy -air nucleotide 617 de* SEQ ID NO r ? x fl4V 

25 13. Materiel nucleique retroviral' selbn H ^i , une 

quiilSohque cles' revendrca^ characterise en 

ce qu'il - est associe a au moins une maiadie auto-immune 
telle que la sclerose en plaques ou la polyarfchrite 
rhama£oide9 r -' -- /fr;oi: ^' : : y ' - p 

3t 0 - • i4 i L Fragm%ht Hr nuci6ot^ urie s6qiience 

nucleotidique choisie r dans le groujpfe qui cbhsiste en (i) 
les sSquenc^S r : r> SiEQ ID NO : 112, SEQ lb v NO^ y 114, 

SEQ ID NO: 117, SEQ ID NO: 120, SEQ ID" NO : 124 , 

SEQ ID NO: 130, SEQ ID NO: 141 et SEQ ID NO: 142 ; (ii) 

35 les sequences xoVlewentaires des sequences (i) ; et (iii) 
les sequences #quivalentes aux sequences (i) ou (ii) , en 
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particulier les sequences presentant pour toute suite de 
100 monomeres contigus, au moins 50 %, et 
pref erentiellement au moins 70 % d'homologie avec 
respectivement les: sequences .(-'i")----OU.o(.i'i:):..:. 
5 15. Fragment nucleotidique selon . la revendication 

14:, consistant en une sequence nucleotidique choisie dans 

x 

le groupe qui cqnsiste en (i) les sequences 
SEQt- ID NO: 112 , SEQ ID NO : 114 , SEQ ID NO : : 117, 

SEQ ID NO: 120,- SEQ ID NO: 124, SEQ ID NO: 130, 

10 SEQ ID NO: .141 et SEQ ID NO: 142 ; (ii) les' sequences 
compl-ementaires des sequences (i) ; et (iii) les sequences 
equivalentes aux sequences (i) ou (ii) , en particulier les 
sequences presentant pour toute suite de 100 monomeres 
contigus, au moins 50*%, et pref erentiellement . au moins 

15. 70 % d'homologie avec respectivement les sequences ( i) ou 
(ii). 

16. Fragment nucleotidique comprenant une sequence 
nucleotidique codant pour un polypeptide presentant, pour 
toute suite contigue d'au moins 3 0 acides amineees, au 

20 moins 50 %, et de; preference au moins. 70 % d^omologie, 
avec une Cseq.uence peptidique choisie dans, le :groupe qui 
consiste en SEQ ID NO: 113, SEQ ID N°115, SEQ ID NO: 118, 
SEQ ID NO: i;121v 'SEQ ID NO: 135 et SEQ : ID NO: :: 137. 
:. :17u. ^Fragment nucleotidique selon : la revendication 

25 16;. consistant en une sequence nucleotidique; - s .codant pour 
tin polypeptide^ presentant, pour toute suite contigue d'au 
moins 30 acides amineees, au moins 50 %, et de ^preference 

: ( au .; moins £7:0 ^%*l:d^homologie, avec une sequence peptidique 
' choisie dans fie groupe jqui consiste. en SEQ ID NO,: 113, 

30 SEQ ID, N? 115:; SEQ ID NO: 118, SEQ ID NO: 121, 

-SEQ ID NO : f 1 3 5: et ; S EQ ID NO: 137. . .: r . 

^ 18; Sonde: nucleique pour. la detection d*un 

retrovirus associe, a la -sclerose en plaques- et/ou la 
polyarthrite rhumatolde, caracterisee en ce quelle est 

35 susceptible de s'hybrider specif iquement sur tout fragment 
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; : selon I'une quelconque des revendications 14 a 17, 
appartenant au genome dudit: retrovirus. 

19. Sonde- selon la revendication 18, caracterisee 
en ce qu 1 elle possetfe de : lO: ; h 100 nucleotides, de 
5 preference de 10 a 30 nucleotides. 

- : 1: v 20 ; Amorce pour 1 ' amplification par polymerisation 

d'un ARN- ou d'un ADN ^d 1 un retrovirus associe a la sclerose 
en plaques et/ou la polyarthrlte rhumatolde,: caracterisee 
eh ce qu'elle cbmprend une sequence nucleotidique 

10 ddentique ou equivalents a au moins une.partie de la 
sequence Nucleotidique d'un .. fragment selon l'une 
i quelconque des revendications 8 a 11, notamment une 
sequence nucleotidique presentant pour toute suite de 10 
; • monomer es contigus, au moins 50 %, de preference au moins 

15 ^70 % : d'homologie avec au moins ladite partie dudit 
fragment. 

: - ^ 21 . Amorce selon la revendication 20, caracterisee 
; eh ce que, sai sequence nucleotidique est - choisie parmi 

- , SEQ; ID NO: 116, SEQ ID NO: 119, .SEQ ID NO : 122, 
20^ SEQ ICPNO: 123 p SEQ ID NO: 126, x SEQ EDiNO: 127, 
>P ^SEQ^ID ^6:^128, SEQ ID* NO: 129 , SEQ ID NO: 132/ et 

SE<J CtDJNC^: 133-;' T- : . ' -• s ic-.. ^ . 

* V L 22>J^ ARN !: " ou ADN,:. vet notammento vecteur de 
3A^^"r§pl=ica4:i^h -et/ou ^ d •expression, > comprenarit un fragment 
251 gehoiaique I d6 ^materiel nucleique selon lAurie . quelconque :des 
? reWhdicatioriis 1 ao? ou un f ragment , selon 1 1 une quelconque 
^ " ^ ^revenaicatibns W a 17. : "t: 

:rq^:j ^^ri^p^Peptl/de code par tout cadre 'de lecture ouvert 
: X appartenant^ ^ a ^ "tin ^ f fragment c c nucleotidique selon 1 1 une 
30 ^ufercorique des ^revendications 14 a: 17/ notamment 
polypeptide, par exemple oligopeptide ; formant ou 
J com^renahfe uri determinant ;aritigenique; reconnu par des sera 
- de ^ patients ihfefctes par fie avirus:aMSRV^l:>l - et/ou chez 
- lesquels^ le ^virus: MSRV-1 a ete ; reactive. - v : ; 
.350: / j ..-'■:> 24. Peptide selon la revendicatioh 23 comprenant 
une sequence identique, partiellement ou totalement, ou 
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equivalente a une sequence choisie parmi SEQ ID NO: 113, 
SEQ ID N°115, SEQ ID NO: 118, SEQ ID NO: 121, 

SEQ ID NO: 135 et SEQ ID NO: 137, 

25. Composition diagnostique, prophylactique, ou 
5 therapeutique, notamment pour inhiber 1' expression d'au 

moins un retrovirus associe a la sclerose en plaques et/ou 

r 

a la polyarthrite rhumatoide, comprenant un fragment 
nucleotidique selon l'une quelconque des revendications 14 
a 17. 

10 26. Procede pour detecter un retrovirus associe a 

la sclerose en plaques et/ou a la polyarthrite rhumatoide, 
dans un echantillon biologique, caracterise en ce que 1 1 on 
met en contact un ARN et/ou un ADN presume appartenir ou 
provenant dudit retrovirus, ou leur ARN et/ou ADN 

15 complementaire, avec une composition comprenant un 
fragment nucleotidique selon l'une quelconque des 
revendications 14 a 17. 
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FIG 2 
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QOITATAGAA QGAOQOCrAG TATGG33TAA TOOOCICIGG GAAAGCAAQC 50 
AYRR TPS M G SP LG NQA 

LIE GPLV WGN PL W ETKP 
L.K DP. YGVI PSG KPS 

OQCAGTACIC AGCAGGAAAA ATAGAATAGG AAAQCTCACA AGGACATACT 100 
PVL SRKN RIG NLT RTYF 
QYS AGK IE.E TSQ GHT 
PSTQ QEK N R KPHK DIL 

TKXJECQOCT OCAGATGGCT AQOCACIGAG GAAG3AAAAA TACTTTCADC 150 

PPL QMA SH.G RKN TFT 
FLPS RWL ATE EGKI LSP 
SSP PDG. PLR KEK YFHL 

TGCAGCTAAC CAACAGAAAT TACTTAAAAC OCTTCAOCAA AOCTTQCACT 200 
CS.P TEI T.N PSPN LPL 
A A N QQKL LKT LHQ TFHL 
QLT NRN YLKP FTK PST 

TAQGCATIGA TAGGAGQCAT CAGATGGGCA AATIATTATT TACIGGAQCA 250 
RH. HPS DGQ III YWTR 

GID STH QMAK LLF TGP 
. A L I API RWP NYYL LDQ 

QOOLTITILA AAACTATCAA GAAGATAGTC AG9Q3CTGIG AAGTGTGOCA 300 

PFQ NYQ EDSQ GL. SVP 
GLFK TIK KIV RGCE VCQ 
AFS KLSR R.S GAV KCAK 

AAGAAATAAT 
K K . 
R N N 
E I 
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FIG 2 (suite) 
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CCCIGTATCT TTAACCTOCT TGTIAAGTTT GTCTCTTOCA GAATCAAAAC 50 
PCIF N L L VKF VSSR IKT 
PVS LTSL LSL S L P ESKL 
LYL .PPC.VCLFQNQN 

TGTAAAACTA CAAATTGTTC TICAAATGGA GCACCAGATG GAGTCCATGA 100 
V K L QIVL QME HQ.M ESMT 
. NY KLF FKWS TRW SP. 
CKTT NCS SNG APDG VH D 

CTAAGATOCA GO3IGGA0QC CIX33AOCX33C CIGCEAOOOC AUGCIDOGAT 150 

KIH RGP LDRP ASP CSD 
LRST VDP WTG LLAH APM 
. DP PWTP GPA C.P MLR C 

GTIAATGACA TTGAAGGGftC QOCIOOOGAG GAAAICICAA CIGGACAADC 200 
V N D . I EGT PPE EIST AQP 
LMT LKAP LPR KSQ LHNP 
. .H .RH PSRG NLN CTT 

OCTACEAIGC COCAATICAG CX3GGAAGCAG TTAGAGCGGT CATCAGOGAA 250 
LLC PNSA GSS .SG HQPT 
YYA PIQ REAV RAV ISQ 
PTMP QF S GKQ LERS SAN 

OTICCCCAAC AGCACTIGGG TTTTOJlUl'l' GAGAG33QGG ACIGAGAGAC 300 

SPT ALG FSC. EGG LRD 
PPQQ HLG FPV ERGD . E T 
LPN STWV FLL RGG TERQ 

AGGACTAQCT GGATTTOC7EA GGCCAACGAA GAATCCCTAA GCCTAGCTOG 350 
RTSW IS. ANE ESLS LAG 
GLA GFPR PTK NP. A.LG 
D.L DFL GQRR IPK PSW 
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GAAGGTGACT GCATOGADCT CTAAACA.TQ3 GQCTIGCAAC TTAQCICACA 400 
KVT ASTS KHG A C N LAHT 
R . L HP P LNMG LAT . L T 
EGDC IHL . T W G L Q L *S S H 

GOOGAOCAAT CAGAGAGCIC ACIAAAATGC TAATEAQGCA AAAATAQGAG 450 

R P I REL T K M L IRQ K E 
PDQS ESS L. KC L G K NRR 

PTN QRAH N A N . A K. I G G 

GTAAAGAAAT AGOCAATCAT CTATIGOCIG AGAQCACAGC QQGAGGGACA 500 
VKK. PII YCL RAQR EGQ 
. R N SQSS IA. EHS GRDK 
K E I ANH LLPE STA GGT 

AQGATOQ33A TATAAAODCA G3CATTCGAG GOGGCAAQGG CAAGCCQCIT 550 
GSG YKPR HSS RQR QPPL 
DRD INP GIRA GNG N P L 
R I G I . T Q AFE PATA TPF 

'IGGGTIDOOCr OX'ITILt'IAT GG3CX3CICIG TITICACICT ATITCACICr 600 

GPL PLY GRSV FTL FHS 
WVPS LCM GAL FSLY FTL 
GSP PFVW ALC FHS ISLY 

ATTAAATCIT GCAACTGAAA AAAAAAAAAA AAAAA 635 
IKSC N.K KKK K 
LNL ATEK KKK K 
• IL QLK KKKK K 
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ATOGCCCTCC CTmTCATAC TTITCTCTTT ACTGTTCTCT TACCOOCTTT 50 
MALP YHT FLF T VLL PPF 
WPS LIIL FSL LFS YPLS 
GPP LSY FSLY CSL TPF 

CX3CTCICACT GCACCnXTC CATGCTGCTG TACAACCAGT AGCTOQOCTT 100 
ALT APPP CCC TTS SSPY 
LSL HPL H A A V QPV APL 
RSHC TP S MLL YNQ. LPL 

ACCAAGAGTT TCTATGAAGA. A0GO33CTTC CTGGAAATAT TGATGOCCCA 150 

QEF L.R TRLP GNI DAP 
T KSF YEE RGF LEIL MPH 
PRV SMKN AAS WKY .CPI 

TCATATAQGA G'i'l'iATCTAA GGGAAACTCC AGCTTCACTG GOCACAOOCA 200 
SYRS LSK GNS TFTA HTH 
HIG VYLR ETP PSL PTPI 
I.E FI. GKLH L H C PHP 

TATGQOQQGC AACTGCTATA ACTCTGOCAC TCTTTGCATG CATGGAAATA 250 
MPR NCYN SAT LCM HANT 
CPA T A I TLPL FAC MQI 
YAPQ L L LCH SLHA CKY 

CTCATEATTG GACAGQGAAA ATGATTAATC CTAGTTGTOC TGGAGGACTT 300 

HYW TGK MINP SCP GGL 
LIIG QGK .LI LVVL EDL 
SLL DREN D.S .LS WRTW 

GGAGOCACTG TCTGTTGGAC TTACTTGACC CATACCAGTA TGTCTGATGG 350 
GATV CWT YFT HTSM SDG 
EPL SVGL TSP IPV CLMG 
SHC LLD^LLHP YQY V.W 
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ADC7ICAOCIG TGTAAAATTT AQCAAIACIA TAGACACAAC CAQCIGQCAA 750 

LTC VKF SNTI DTT SSQ 
TSPV N L AIL. T Q P APN 

PHL CKI. QYY RHN QLPM 

TGCATCAGGT QQGflAACACC T0O2ACAO3A ATAGTCIGCC TAQCCTCAGG 800 
CIRW VTP PTR IVCL PSG 
ASG G.HL PHE .SA YPQE 
HQV GNT SHTN SLP TLR 

AATAT.L'iTJ.T GTCIGTGGTA GCICAQOCTA. TCATIGTTIG AATGGCICIT 850 
IFF VCGT SAY HCL NGSS 
YFL SVV PQPI IV. MAL 
NIFC LWY LSL SLFE WLF 

CAGAATCTAT GIGCTIDCTC TCATICITAG TOOOOOCIAT GAOCATCTAC 900 

ESM CFL SFLV PPM TIY 
QNLC ASS HS. CPL. P ST 
RIY VLPL ILS APY DHLH 

ACTGAACAAG ATITATACAA TCA2GTOGTA OCTAAGOGOC ACAACAAAAG 950 
TEQD LYN HVV PKPH NKR 
LNK IYTI MSY LSP TTKE 
. TR FIQ SCRT .AP QQK 

AGTADQCATT CTTDL'l'l'l'lG TTATCAGAGC AGGAGTGCTA. GGCAGACTAG 1000 
VPI LPFV IRA GVL GRLG 
YPF FLL LSEQ EC. A D. 
STHS SFC YQS RSAR QTR 

GTACIGGCAT T3GCAGTATC ACAACCTCEA CTCAGTICTA CTACAAACTA 1050 

TGI GSI TTST QFY YKL 
VLAL AVS QPL LSST TNY 
YWH WQYH NLY SVL LQTI 
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. 10 20 30 40 50 
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TCICAAGAAA TAAATQGTGA CATOSAACAG GTCACTGACT CCCJX33TCPC 1100 
S QEI NGD MEQ VTDS LVT 
LKK M V T WNR S L T PWSP 

SRN KW. HGTG H.L PGH 

CITOCAAGAT CAACTEAACT CCCTAQCAGC AGTAGTOCIT CAAAATCGAA 1150 
LQD QLNS Li A A VVL QNRR 
CKI NLT P.QQ .SF KIE 
LARS T.L PSS SSPS KSK 

GAQCTITAGA Cl'lUClA AGC GCCAAAAGAG GQQGAACCTG TTTAi'l'l'l'IA 1200 

ALD LLT AKRG GTC LFL 
EL.T C.P PKE GEPV YF. 
SFR LANR QKR GNL FIF R 

GGAGAAGAAC GCT3TTATEA T3TTAATCAA TCCAGAATIG TCACIGAGAA 1250 
GEER CYY VNQ SRIV TEK 
EKN AVIM LIN PEL SLRK 
RRT LLL C.SI QNC H.E 

AGTEAAAGAA ATIDGAGATC GAATACAATC TAGAQCAGAG GAGCTIGAAA 1300 
VKE IRDR IQC RAE ELQN 
LKK FEI EYNV EQR SFK 
S.RN SRS NTM S R G ASK 

ACACQGAAQG CTGGGOOCTC CICAGGCAAT GGAT3GGCIG GGTTCIGQOC 1350 

TER WGL LSQW MPW VLP 
TPNA GAS SAN GCPG FSP 
HRT LGPP QPM DAL GSPL 

TICTEAQGAC CICEAGCAQC TC7IAATATIG TEACTOCTCT TIGGACOCIG 1400 
FLGP LAA LIL LLLF GPC 
S.D L.QL .YC YSS LDPV 
LRT SSS SNIV TPL WTL 
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FIG 4 (suite) 
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TATCITTAAC CTOLTIUI'IA AGETIGTCIC TICCAGAATT GAAQCIGTAA 1450 
IFN LLVK FVS SRI EAVK 
SLT SLL SLSL PEL KL. 
YL.P PC. VCL FQN. SCK 

AGC3ACAGAT G3ICITACAA. A1Q3AACCCC A 1481 

LQM VLQ MEP 
SYRW SYK WNP 
ATD GLTN GTP 
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TCAAAATCGA AGAGCTTTAG ACTIGCIAAC CX30CAAAAGA. GQGQGAAaCT 50 
SKSK SFR LAN RQKR G N L 
QNR R A L D L L T AKR GGTC 
KIE EL. TC.P PKE GEP 

(JlTimTl ' lT AGQQGAAGAA T3CIGTEAGT ATGTIAATCA ATCIQGAATC 100 
FIF RGRM LLV C.S IWNH 
LFL GEE CC . Y-VNQ SGI 
VYF. GKN AVS MLIN LES 

ATIACTGAGA AAGTEAAAGA AATTIGAGAT OSAATATAAT GEAGAGCAGA 150 

Y.ES.RNLRSNIM .SR 
ITEKVKE I . D RI.C RAE 
LLR KLKK FEI EYN VEQR 

GGAGCTICAA AACACIGCAC CCTGGQQOCT CCICAGGCAA TOGATGQQCT 200 
GPSK HCT LGP PQPM DAL 
DLQ NTAP WGL LSQ WMPW 
TFK TLH PGAS SAN GCP 

G3ACTCTCQC CTICTEAGGA CCICIAGCAG CEATAATATT TITAOTGCIC 250 
DSP LLRT SSS YNI FTPL 
TLP FLG PLAA I I F LLL 
GLSP S.D L.Q L.YF YSS 

TTTGGAQQCT GTATCTTCAA CTIDCTIGTT AAGTTT3TCT CTTOCAGAAT 300 

WTL YLQ LPC. VCL FQN 
FGPC IFN FLV KFVS SRI 
LDP VSST SLL SLS LPEL 

TGAAG L ' IUIA AAGCXACAAA TAGl'lUl'lCA AA1QGAAQQC CAGATGCAGT 350 
. SCK ATN SSS NGTP DAV 
EAV KLQI VLQ MEP QMQS 
K L SYK F F K WNP RCS 
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FIG 5 (suite) 
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(XA3GACTAA AATCEAQOGT GGAQOQCTGG ACpQGCCTGC TAGACIATOC 400 
HD. NLPW TPG PAC T M L 

MT K IYR GPLD RPA RLC 
P.LK STV D P W TGLL DYA 

TCIGATC7ITA ATGACATIGA AGICAOCXXT CQOGAQGAAA TCICAACTGC 450 

. C . . H . SHPSRGNLNC 
SDVN DIE VTP PEEI STA 
LM L MTLK SPL PRK SQLH 

ACAAQGQCm CIACACTQCA ATTCAGTAGG AAGGAGTTAG AGGAGTTGIC 500 
TTPT T L Q FSR KQLE QLS 
QPL LHSN SVG SS. SSCQ 
NPY YTP IQ.E A V R AVV 

AQOCAAOCIC GCGAACAGTA CTIQ3GTTIT GCIGTIGAGA GGGIGGACIG 550 
ANL PNST WVF L.LR GWTE 
PTS PTV LGFS C.E GGL 
SQPP QQY LGF PVER VD. 

AGAGACAGGA CEAGCIGGAT TIGCTAGGCT GACTAAGAAT QGCNAAQOCT 600 

RQD L D FLG. LRI PKP 

RDRT S W I S.A D.ES XSL 
ETG LAGF PRL TKN PXAX 

ANCIGQGAAG GTSAQGGCAT GCATCTTIAA ACATC3GQGCT TOCAACTTAG 650 
XWEG DRI H L TWGL QLS 

XGK VTAS IFK HGA CNLA 
LGR .PH PSLN MGL AT. 

CICACAQOOG ACCAATCAGA GAGCIGACTA AAAT3CTAAT CAGQCAAAAA 700 
SHP TNQR AH. NAN QAKT 
HTR PIR ELTK M L I RQK 
LTPD QSE SSL KC.S GKN 
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CAGGAQGTAA AGCAATAGQC AATCATCTAT TG3CIGAGAG CACAGOGQGA 750 

GGK A I A N H L L PES TAG 
QEVK Q.P I I Y CLRA QRE 
RR. SNSQ SSI A . E HSGK 

AGGACAAQGA TIQ9GATATA AACTCAQQCA TTCAAQQCAG CAACAGCAAC 800 
RTRI G I . TQA FKPA TAT 
GQG LGYK LRH S SQ QQQP 
DKD W D I NSGI Q A S NSN 

OCXXTITGQG TODOCIOXA T1GTATGQ3A GCTCTGITTT CACTCTATIT 850 
PFG SPPI VWE LCF HSIS 
P L G PLP LYGS SVF TLF 
P L W V PSH CMG ALFS LYF 

GACTCTATTA AATCATGCAA CTGCACTCTT CIGGTCOGIG TTITTTATGG 900 

L Y . I M Q LHSS GPC F L W 
HSIK SCN CTL LVRV FYG 
TLL NHAT ALF WSV FFMA 

CIGAAGCIGA GJl'lTlUl'lC QOCATGCAOC ACTGCTGTTT QGCACCGTCA 950 
LKLS FCS PST TAVC HRH 
SS. AFVR HPP LLF ATVT 
QAE LLF AIHH CCL PPS 

CAGAOOCQCT GCIGACTTOC ATG3CTTTQG ATOCAGCAGA GIGTCCACTG 1000 
RPA ADFH PFG SSR VSTV 
DPL LTS IPLD PAE CPL 
QTRC .LP SLW IQQS VHC 

TGCTOCTGAT OCAGOGAGGT AGQCATTGQC ACTOCOGATC AGOCTAAAGG 1050 

LLI QRG THCH SRS G.R 
CS.S SEV PIA TPDQ AKG 
APD PARY PLP LPI RLKA 
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FIG 5 (suite) 



10 20 30 40 50 

123456789 0 1234567890 1234567890 1234567890 1234567890 

CTIGOCATTG TIGCTGCATG GCTAAGIGCC TG3GTTTGTC CTAATAGAAC 1100 
LA I V PAW LSA WVCP NRT 
LPL FLHG .VP GFV LIEL 
CHC SCM AKCL GLS . .N 

TGAACACTGG TCACTGGGTT GGAIGGTICT CTIDCATCAC CCACG3CTTC 1150 
EHW SLGS MVL FHD PRLL 
N TG HWV PWFS SMT HGF 
. TLV TGF HGS LP.P TAS 

TAATAGAGCT ATAACACTCA GQ32A1GGQC CAAGATICCA. TIXXTIGGTA 1200 

IEL .HS PHGP RFH SLV 
- . SY NTH RMA QDSI PWY 
NRA ITLT AWP KIP FLGI 

TCIGIGAGGC CAAGAAOOOC AGGTCAGAGA ANGIGAGGCT TOOGAOGATT 1250 
SVRP RTP GQR X.GL PPF 
L.G QEPQ VRE X E A CHHL 
CEA KNP RSEX VRL ATI 

TGGGAAGIGG OQCACTGCCA TTTIGGTAGC G3CCCACCAC CATCTIGGGA 1300 
GKW PTAI LVA AHH HLGS 
GSG PLP FW.R PTT ILG 
WEVA HCH FGS GPPP SWE 



GCIGTGGGAG CAAQGATQQC C3CAGTAACA 

CGS KDP PVT 
A V G A RIP Q. 
LWE QGSP SN 



1329 
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CCTAGAACGr ATICIGGPGA ATTOQGACCA ATGTTGACACT CAGACQCTAA 50 
PRTY SGE LGP M.HS-DAK 
LER ILEN WDQ CD T QTLR 
. NV FWR IGTN VTL RR. 

GAAAGAAAGG ATTTATATIC TTCTGCPGTA OOQOCIGQQC ACAATAIOCT 100 
KET I Y I L L Q Y RLA TISS 
KKR FIF FCST AWP QYP 
ERND LYS SAV PPGH NIL. 

CTICAAGQGA GAGAAAOCIG GCTICCTGAG GGAAGTATAA ATTATAACAT ISO 

SRE RNL AS.G KYK L.H 
LQGR ETW L P E GSIN YNI 
FKG EKPG FLR EV. I I T S 

CATCITACAG CEAGAQCICT TCTG7EAGAAA GGAG33CAAA TGGAGIGAAG 200 
HLTA RPL L.K GGQM E.S 
I L Q L D L F CRK EGK WSEV 
SYS .TS SVER RAN GVK 

TOCCATATCT GGAAACTITC Tl'l'lCATTAA GAGACAACIC ACAATTATCT 250 
AIC A N F L FIK RQL TIM. 
PYV QTF F S L R DNS Q L C 
CHMC K L S FH. ETTH NYV 

AAAAAGIGIG GTITEATQC3C3C TACAGGAAGC OCICAGAGIC CACCTCXXTTA 300 

KVW FMP YRKP S ES TSL 
KKCG LCP TGS PQSP PPY 
KSV V Y A L QEA L R V H L P T 

OC3Cr3G0GTC COCIXXXXXjA CIULTIOCTC AACTAATAAG GAOXaCCTT 350 
PQRP LPD SFL N .G PPF 

PSV PSPT PSS TNK DPPL 
PAS PPR L L P Q L I R TPL 

TASOOCAAAC G3TOCAAAAG GZKMBGACA AAGQGCTAAA CAATCAACCA 400 
NPN GPKG DRQ RGK Q.TK 
TQT VQK EIDK GVN NEP 
P K R SKR R.T KG.T MNQ 

AAGACT3CXA ATAl'IQXDG ATEATOOOOC CTOCAAGCAG IGAGAGGAGG 450 

ECQ YSP IMPP PSS ERR 
KSAN IPR LCP LQAV RGG 
RVP IFPD YAP SKQ E E E 

AGAA3TOG3C GCAGOCAGAG TOCOCTACC TlTl'lCICIC ICAGACTTAA 500 
RIRP SQS AC.T FFSL RLK 
EFG PARV PVP FSL SDLK 
NSA QPE CLYL FLS Q T 
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AGCAAATTAA AATAGAOCTA GCTAAATTCT CPGATAPCCC TCACQQCTAT 550 
AN. NRPR . I L R.P --RL Y 
QIK IDL GKFS DNP DGY 
SKLK . T . V N S QITL T A I 

ATIGATCITT TACAAGGGTT AQGACAATOC TTIGAICIGA 600 

. CF TRV R T I L . S D MER 
IDVL QGL G QS FDLT WRD 
LMF YKG. DNP LI. HGEI 

TATAATCTTA CTACTAAATC AGACACTAAC CQCAAATCAG A3AACT3003 650 
Y N V T TKS D T N PK.E KCR 
I M L LLNQ T L T PNE RSAA 
. C Y Y.I RH.P QMR EVP 

CICTAACTGC AOOCXIGA3AG TTIQQCGATC TTK3CTATCT CAGITIAGQQC 700 
CNC SPRV WRS LVS QSGQ 
VTA ARE FGDL WYL SQA 
L.LQ PES LAI FGIS.VRP 

AACAATAQ3A TCACAACAGA QGAAAGAACA ACICOCACAG GCCA3C2GQC 750 

Q.D DNR GKNN SHR PAG 
N N R M TTE ER T TPTG QQA 
TIG QQR KEQ LPQ ASRQ 

H3TTOCCP&T GTA3AOQCIC ATTOQG&CAC AGAATCfcGAA CATO3A3A3T 800 
SSQC RPS L G H RIRT WRL 
VPS VDPH WDT ESE HGDW 
FPV T Li IGTQ N Q N MEI 

GGIGOCACAA ACA1T1ULTA ACnQOGflQC TAGAMGACT GBGGAAAftCT 850 
VPQ TFAN L R A RRT EEN. 
CHK HLL T C V L EGL RKT 
GATN IC. LAC K D GKL 

CTATCAATTA CICAATCATO TCX^ACEATAA QCM03AAA 900 
EEA YEL LNDV HYN TG K 
RKKP MNY SMM STIT QGK 
GRS L.IT Q.C PL. HRER 

OGAAGAAAAT CTmi'lUL'lT TICIQGAC?^ ACIAM3GA3 QCAITGZOGA 950 
GRKS YCF SGQ TKGG IEE 
EEN LTAF LDR LRE ALRK 
KKI LLL FWTD .GR H.G 

PGCATPCCTC OTICTCAOCT GftCICTATTC AMQOCAACT AATCTEAAA3 1000 
AYL PVT. LY. RPT NLKG 
HTS LSP DSIE GQL ILK 
SI PP CHL TLL KAN. S.R 
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GATAPCTTTA TCPCTTCPGTC AGCTOCfcGAC ATTAGAAAAA ACTCCAAAA3 1050 

.VYHSVSCRH .KKLQK 
DKFI TQS A A D IRK N FKS 
I S L S L, S Q LQT L E K TSKV 

TCTGCCTTPG GOOOQGAGCA GAACTTAGAA AOGCEATITA ACITOGCATC 1100 
SALG PEQ NLE T L F N LAS 
LP. ARSR T.K PYL TWHP 
CLR PGA ELRN PI. LGI 

CICA GT1T1T TA!EAATAGAG ATCAGGAGGA GCAG0O3AAA OGQGACAAAC 1150 
SVF YNRD QEE QAK RDKR 
QFF I I E IRRS RRN GTN 
LSF L . R SGG AGET GQT 

Q3GATAAAAA AAAASGQQGG GGiaCACTAC TTTACTCATC GCmiCSGQC 1200 

DKKKRGGPLL .SWPSG 
GIKK KGG VHY FSHG PQA 
G.K KKGG STT LVM ALRQ 

AMCAGACTT TOGAGQCTCT QCAAAAGGGA AAAQCTQQ3C AAATCAAA1G 1250 
KQTL EAL QKG KAGQ I K C 
SRL WRLC KRE KLG KSNA 
ADF GGS AKGK SWA NQM 

(XTCAATAG3G CIQQCTTOCA GIGOGGICTA CAM3ACACT TEAAAAAAGA 1300 
LIG LASS AVY KDT LKKI 
. . G WLP VRST RTL .KR 
PNRA GFQ CGL QGHF KKD 

TEATOCAAjT AGAAATAA3C 03CXXCCTTG TOCATOOaOC TTAOGTCA^G 1350 

IQV EIS RPLV HAP YVK 
LSK. K.A APL SMPL TSR 
YPS RNKP PPC PCP LRQG 

QGAATCACIG GA2GQ00C2C GATCAAGATA CICIGfiGICA 1400 

GITG RPT APG DEDT LSQ 
ESL EGPL PQG MKI L.VR 
NHW KAH CPRG R Y S E S 

GAAGOCATEA AOGAGATCAT CCPOZP&CPG GACIGfiGQGfT 0000303003 1450 
KPL TR.S SSR TEG ARGE 
SH. PDD PAAG LRV PGA 
EAIN Q M I QQQ D.GC PGR 

AGOOOCAGCT CAIOXMCA COOICACAGA GOOOCDOSm 'lUlTIUAOCA 1500 

RQP MPS PSQS PGY V.P 
SASP CHH PHR APGM FDH 
APA HAIT LTE PRV CLTI 
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ATOGGCAGCA GOCATCATCA TCATCATCAC AGCAQaGGOC TGGTGCCGCG 50 
MGSS HHH HHH SSGL VPR 

CGQCBGOCAT ATOGCTAGCA IGACIGGIGG ACAGCAAATG GGTOQGATOC 100 
GSH MASM TGG QQM GRIL 

TAGAAQGrEAT TCIGGAGAAT TOQGAGGAAT GTGACACTCA GAGQCTAAGA 150 
ERI LEN WDQC DTQ TLR 

AAGAAAOGAT TTATATTCIT CTGC^GTftOC GQCIGGGCAC AATATCCTCT 200 
KKRF IFF CST AWPQ Y P L 

TCAAGGGAGA GAAAQCTO3C TIDCIGAGGG AAGEATAAAT TATAACATCA 250 
QGR ETWL PEG SIN YNII 

TUITACAGCT AGACCICITC TCTAGAAAGG AGQQCAAATG GAGIGAAGIG 300 
LQL DLF CRKE GKW SEV 

CCATATGTGC AAACITIUIT TTCATEAAGA GACAACTCAC AATTA1GTAA 350 
PYVQ TFF SLR DNSQ LCK 

AAAGIGIQGT TTATC50QCTA CAGGAAQGQC TCAGAGTCCA OGTOXTAaC 400 
KCG LCPT GSP QSP PPYP 

CCAGO3TO0C CIOXDSACr CCTIDCTCAA CTAATAAGGA GOOOOOYTm. 450 
SVP SPT PSST NKD PPL 

AO0CAAAO3S TCCAAAAGGA GATAGACAAA GGGGEAAACA ATGAACCAAA 500 
TQTV QKE IDK GVNN EPK 

GAGTOCCAAT ATTOQOOSAT TATOCOCOCT GCAAGGAGTC AGAQGAGGAG 550 
SAN IPRL CPL Q A V RG GE 

AATTOGGOCC AGOCAGAGIG GCIGTAOCTT TITCTCICIC AGACTEAAAG 600 
FGP A R V PVPF SLS DLK 

CAAATEAAAA TAGAOCTAGG TAAATTCTCA GATAAGOCTG ACQ3C7EAIAT 650 
QIKI DLG KFS DNPD GYI 

"IGAIGITITA CAAGGGTEAG GACAATOCTT TGA1UIGACA TOGAGAGATA 700 
DVL QGLG QSF DLT WRDI 

TAATOTEACT ACTAAATCAG ACACEAAOOC CAAATGAGAG AAGIGOCGCT 750 
MLL LNQ TLTP NER S A A 

GTAACTGCAG OXGAGAGTT IGGGEATCTT TO3CATCICA GTCAGGOCAA 800 
V T A A REF GDL WYLS QAN 
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CAATAGGATC ACAACAGAGG AAAGAACAAC TOCCACAGGC CAGCAQGCAG 850 
NRM TTEE RTT PT G QQAV 

TKXX^GIGT AGAOXTCAT TQQGACACAG AATCAGAACA TOGAGATIGG 900 
PSV DP H WDTE SEH GDW 

TOOCACAAAC ATITGCTAAC TIGDGTCJCTA GAAQGACIGA GGAAAACTAG 950 
CHKH LLT CVL EGLR KTR 

GAAGAAGQCT ATGAATIACT CAATCATGIC CACTATAACA CAGGGAAAQG 1000 
KKP MNYS MMS TIT QGKE 

AAGAAAATCT TACT3CTITT CIGGACAGAC TAAQQGAGGC ATTGAGGAAG 1050 
ENL TAF LDRL REA LRK 

C^TAQCTOOC TCTCAOCTGA CTCTATIGAA QQCCAACTAA TCTIAAAGGA 1100 
HTSL SPD SIE G Q Li I LKD 

TAAGTTEATC ACICAGTCAG CTGCAGACAT TAGAAAAAAC TTCAAAAGIC 1150 
KFI TQSA A D I RKN FKSL 

TOOCTAAGCT TGCGQQOQCA CTOGAGCAQC AQCAOCAOCA CCACIGAGAT 1200 
PKL AAA LEHH HH H H.D 

OCGGCIGCTA ACAAAGQQCG AAAQGAAQCT GAGITOGCIN GIGGCNA 1247 
PAAN KAR KEA E L. A X G 
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ATOGCTAQCA TCACTOOTOG ACAGCAAATG GGTO33ATCC TAGAAOGTAT 50 
M ASM TGG QQM G R I L ERI 

TCIQGAGAAT TQQGAOCAAT GTTGACACTCA GAGQCTAAGA AAGAAAOGAT 100 
LEN WDQ C DTQ TLR KKRF 

TTATATTC3T CIGC^GTACC GCCIGQOCAC AATATCCIUT TCAAGQGAGA 150 
IFF CST AWPQ YPL QGR 

GAAAOCTQQC TTCCIGAGGG AAGTATAAAT TATAACATCA TCTIACAGCT 200 
ETWL PEG SI N YNII LQL 

AGACCICTIC TCIAGAAAGG AQGGCAAATG GAGIGAAGIG CXATAIGIGC 250 
DLF CRKE GKW SEV PYVQ 

AAACTITCIT TICATTAAGA GACAACICAC AATTATCIAA AAAGIGIQGT 300 
TFF SLR DNSQ LCK KCG 

TTATGOQCEA CAQGAAGOOC TCAGAGTOCA OCIUXTAOC 350 
LCPT GSP QSP PPYP SVP 

CIOQOOGACT CTTIOCTCAA CEAATAAGGA QOOCOCTITIA ADC3CAAAQQG 400 
SPT PSST NKD PPL TQTV 

TCX2AAAAGGA GATAGACAAA GQQGEAAACA ATCAAOCAAA GAGIXXCAAT 450 
QKE IDK GVNN EPK SAN 

ATICOOOGAT TATCOOQQCT CCAAGC2CTG AGAGGAGGAG AATIOQGQOC 500 
IPRL CPL Q A V RGGE FGP 

AGOCAGAGIG QCTIGIAOCTT TITC7ICICIC AGACITAAAG CAAATEAAAA 550 
A R V PVPF SLS DLK QIKI 

TAGAOCTAGG TAAATIUICA GATAADOCIG AOGQCEATAT TGATCTETTEA 600 
DLG KFS DNPD GYI DVL 

CAAGQGTEAG GACAAICCTT TCATCIGACA TOG£GAGATA TAATCTEACT 650 
QGLG QSF DLT WRDI MLL 

ACEAAATCAG ACACEAAOOC CAAATCAGAG AAjIGQCGCT GTAACIGCAG 700 
LNQ TLTP NER SAA V T A A 

O0CGAGAGTT TOGOGATCTT TOCTAICICA GICAGGOCAA CAATAGGATC 750 
REF GDL WYLS Q A N NRM 

ACAACAGAGG AAAGAACAAC TOOCACAGGC CAGCAGQCAG TICOCAGIGT 800 
TTEE RTT PTG QQAV PSV 
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AGAQOCTCAT TO3GACACAG AATCAGAACA TQGAGATIQG TOCOO^AAC 850 
DPH WDTE SEH GD W CHKH 

ATITQCTAAC TIGCOIQCTA GAAGGACIGA GGAAAACTAG GAAGAAGQCT 900 
LLT CVL EGLR KTR KKP 

ATCAATTACT CAATGATCTC CACTATAACA CAGGGAAAQG AAGAAAATCT 950 
MNYS MMS TIT QGKE ENL 

TRCIQCT1T1' CIQGACAGAC TAAGQGAQQC ATTGAGGAAG (^TACCTOOC 1000 
TAF LDRL REA LRK HTSL 

TCICAOCTCA CTCTATTCAA GGQCAACTAA TCTTAAAQGA TAAGITTATC 1050 
SPD SIE GQLI L. K. D K F I 

ACICAGICAG CIGGAGACAT TAGAAAAAAC TTCAAAAGIC TOQCTAAGCT 1100 
TQSA ADI RKN FKSL PKL 

TGOGGCCGCA CTCGAQCAOC AOCAGCAOCA (XACIGAGAT COGQCTOCTA 1150 
AAA LEHH HHH H.D PAAN 



ACAAAGOOOG AAAGGAAGCT GAGITGQCTC GIGGCA 
KAR KEA ELAG G 
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TCICCQCIGT GCIUCTGATC CAGCACAQGC GCXXATTGGC TCTCG2AATT 50 
CPLC S . S STG AHCL SQL 
VRC APDP AQA PIA SPNW 
SAV LLI QHRR PLP LPI 

GQGCTAAAGG CTTGOCATTG TT0CT3CACA GCTAAGTGQC TGGGTICATC 100 
G.R L A I V PAQ LSA WVHP 
AKG Li P Li FLHS .VP GFI 
GLKA CHC SCT AKCL GSS 

CTAATQGAGC TGAACACTAG TCACTGGGTT CCAOQGTICr CITCCATGAC 150 

NRA EH. SLGS TVL FHD 
LIEL NTS HWV PRFS SMT 
.SS TLV TGF HGS LP.P 

OIAIGGCITC TAATAGAGCT ATAACACTCA CIGCATGGIC CAAGA1TGCA 200 
PWLL IEL .HS L HGP RFH 
HGF SY NTH CMV QDSI 

M AS NRA ITLT AWS KIP 

TTGCITGGAA TCOGTGAGAC CAAGAACGQC AGGICAGAGA ACACAAGGCT 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN TRL 

TGCCACCATG TTGGAAGGAG OXACXAOCA TTTTGGAAGC AGOQGQOCAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKQ PAT 
ATM LEAA HHH FGS SPPL 

TATCTTGQGA GCTCIGGGAG CAAGGAOOOC AQGTAACAAT TTGGIGAGGA 350 
YLGS SGS KDP R.QF GDH 
ILG ALGA RTP GNN LVTT 
SWE LWE QGPQ VTI W.P 

CGAAGGGACC TGAATOOGCA AGCATGAAGG GATCIOGAAA GCAATTQGAA 400 
E G T .IRN HEG ISK AIGN 
KGP .ESA TMKG SPK QLE 
RRDL NPQ P.R DLQS NWK 
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AT3TICCTCC CAAQQCAAAA ATGCXXJCTAA GATGEATICT GGAGAATIQG 450 

VPP KAK MPL R CIL ENW 
MFLP RQK CP. DVFW RIG 
CSS QGKN APK MYS GELG 

GACCAATTIG ACCCICAGAC AGTAAGAAAA AAATGACTTA TATICTTCTG 500 
DQFD PQ T VRK K.LI FFC 
TNL TLRQ -EK N D L Y SSA 
PI. PSD SKKK MTY ILL 

CAGTACCGOC CIGGCCACGA TATCCICTIC AAQQQQGAGA AACCIGGCCT 550 
STA LATI SSS RGR NLAS 
VPP WPR YPLQ G GE TWP 
QYRP GHD ILF KGEK PGL 

CCIGAGQGAA GTATAAATTA TAACACCATC TIACAGCTAG ACCIGTTTIG 600 

.GK YKL H H L TAR PVL 

PEGS INY NTI LQLD LFC 
LRE V.II TPS YS. TCFV 

TAGAAAAGGA QGCAAATGGA GIGAAGIGCC ATATITACAA ACTTICITIT 650 
K R R QME . S A IFTN FLF 
RKG GKWS EVP YLQ TFFS 
EKE ANG VKCH IYK LSF 

CAITAAAAGA CAACTCGCAA TIATGTTAAC AGTGTGATTT GIGTICCTAC 700 
IKR QLAI MLT V.F VFLH 
LKD NSQ LC.Q CDL CSY 
H.KT TRN YVN SVIC VPT 

ACQGAAGQCC TCAGATICTA CICCCCAGCC G03GCATCrC CCCIGAATCC 750 
GSPQILLPTPGISPES 

TEAL RFY SPP PASP LNP 
RKP SDST PHP RHL P. IP 



CICCCCAACT TATT 
L P N L 
S P T- Y 
P Q L I 
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TGTCCGCTGT GCTOCIGA.TC CAQCAGAQQC GCCCATTGCC TCT3QCAATT 50 
CPLC S.S STG A H C L SQL 
V R C A P D P A Q A PIA SPNW 
SAV L L I QHRR PLP LPI 

GGQCTAAAQ3 CTTGOCATTG TTtCTOCACA GCTAAGTGGC TGQGTTCATC 100 
G.R LAIV PAQ LSA WVHP 
A K G L P L F L H S_ . VP G F I 
G L K A CHC SCT AKCL GSS 

CTAATCGAGC TGAACACTAG TCACIGQGTT GCAGGGTTCT CTTCCATGAC 150 
NRA EH. SLGS TVL FHD 

LIEL NTS HWV P RFS SMT 
.SS T L V TGF HGS LP.P 

GCATGGCITC TAATAGAQCT ATAACACICA CT3CATGGTC CAAGATICCA 200 
PWLL I E L .HS LHGP RFH 
HGF . .SY NTH CMV QDSI 
MAS NRA ITL T AWS KIP 

TTGCTIGGAA TCQGTGAGAC CAAGAACGQC AGGTCAGAGA ACACAAGGCT 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN TRL 

TOCCAOCATG TTGGAAGCAG CXDGAOCAGCA TITTGGAAGC GGQ0O3GCAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKR PAT 
ATM LEAA HHH FGS GPPL 

TATCITGQGA QCTCIGQGAG CAAGGAOGQC CAGGTAACAA TTTGGTGAOC 350 
YLGS SGS KDP QVTI W.P 
ILG ALGA RTP R.Q FGDH 
SWE LWE QGPP GNN LVT 

AGGAAGQGAC CTGAATCOGC AAGCATGAAG GGATCICCAA AGCAATIGGA 400 
RRD LNPQ P.R D LQ SNWK 
EGT .IR NHEG I SK AIG 
TKGP ESA TMK GSPK QLE 
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AATGITCCTC CCAAG9CAAA AATGCCCCEA. AGAIGTATTC TOGAGAATTG 450 

CSS QGK NAPK M Y S GEL 
NVPP KAK MPL R C I L ENW 
M F L PRQK CP. DVF WRIG 

GGAOCAATCT GACCCTCAGA CAGIAAGAAA AAAAATGACT TATAT1LT1C 500 
GPI. PSD SK K KNDL YSS 
DQS DPQT. VRK KMT YILL 
T N L T L R Q.EK K . L IFF 

TGCAGTACCG CCIGGCCACG GATATCCICT TCAAGGGGGA GAAACCIGQC 550 
AVP PGHG YPL QGG ETWP 
QYR LAT DI L F K G E K P G 
CSTA WP R ISS SRGR NLA 

CICCTGAGGG AAGTATAAAT TATAACACCA TCTTACAGCT AGACCTGTTT 600 

PEG SIN YNTI LQL DLF 
LLRE V.I ITP SYS. TCF 
S. G KYKL .HH LTA RPVL 

TGTAGAAAAG GAQGCAAATG GAGIGAAGIG CGATATITAC AAACTITCTT 650 
CRKG GKW SEV PYLQ TFF 
VEK EANG VKC HIY KLSF 
.KR RQM E.SA IFT NFL 

TICATIAAAA GACAACTCGC AATEATCEAA ACAGIGIGAT TIGIGICCTA 700 
SLK DNSQ LCK QCD LCPT 
H.K TTR NYVN SVI CVL 
FIKR QLA IM. TV.F VSY 

CAQGAAGCCC TCAGATCTCAC CTCCCTACOC CQ3CmCTCC CIGACICCTT 750 

GSP QIY LPTP ASP L L 

QEAL RST SLP RHLP DSF 
RKP SDLP PYP GIS LTPS 

CCCCAACIAA TAAGGACCCA CTICAGCCCA AACAGTCCAA AAGGACATAG 800 
PQLI RTH FSP NSPK GH 
PN. G P T SAQ TVQ KDI 

PTN KDP LQPK QSK RT. 



WO 99/02666 



25 / 32 
FIG 11 



PCT/FR98/01460 



10 20 30 40 50 

1234567890 1334567890 1234567890 12345678 90 1234567890 

GGCATIGATA GCACCCATCA GATO30CAAA TCATIATTTA CTGGACCAGG 50 
GIDS THQ MAK SLFT GPG 
ALI APIR WPN H Y Li LDQA 
H. . HPS DGQI IIY WTR 

OJl ' l ' l ' l CAAA ACTATCAAGC AGATAGQGCC CXJIGAAGCAT GOCAAAGAAA 100 
LFK TIKQ IGP VKH AKEI 
FSK LSS R.GP .SM PKK 
PFQ. N YQA DRA REAC QRN 

TAATCCCCIG CCITATCQCC AlUITCCTIC AGGAGAACAA AGAACAGGQC 150 
IPC LIA MFLQ ENK EQA 
SPA LSP CSF RRTK NRP 
NP L PYRH VPS GEQ RTGH 

ATIACCCAGG QGAAGACTQG CAACTAGATT TIACCCACAT GGCCAAAIGT 200 
ITQG KTG N.I LPTW PNV 
LPR GRLA TRF YPH GQMS 
YPG EDW QLDF THM AKC 

CAGGGATTIC AGCATCEACT AGICTGQQCA GATACTTTCA CIQGTIQQGT 250 
RDF SIY. SGQ ILS LVGW 
GIS AST SLGR YFH WLG 
QGFQ HLL VWA DTFT GWV 

QGaGTCTICT CCTIGTAGGA CAGAAAAGAC CCAAGAGGTA ATAAAGGCAC 300 

S LLLVGQKRPKR. .RH 
GVFS L.D RKD PRGN KGT 
ESS PCRT EKT QEV IKAL 

TAATGAAATA ATICCCAGAT TTGGACTIOC CCCAGGATTA CAG33TGACA 350 
.NN SQI WTS PRIT G.Q 
NEI IPRF GLP PGL QGDN 
MK. FPD LDFP QDY RVT 
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ATOOQOOCGC TTICAAGGCT GCAGTAACCC AGGGAGTATC CJCAQGIG3TA 400 
WPR FQGC SNP GSI PGVR 
GPA FKA AVTQ GVS QVL 
MAPL SRL Q.P REYP RC. 

GGCATACAAT ATCACTTACA CEGIGQCIGG AQGCCACAAT CC1CCAGAAA 450 

HTI SLT LCLE ATI LQK 
GIQY HLH CAW RPQS SRK 
AYN ITYT V PG GHN PPEK 

AGTCAAGAAA ATGAATGAAA CACICAAAGA TCTAAAAAAG CTAACCCAAG 500 
SQEN E.N TQR SKKA NPR 
V K" K MNET LKD LKK LTQE 
SRK .MKHSKI . K S . P K 

AAACOCACAT TGCATGACCT GTICIGTIGC CTATAACCTT ACTAAGAATC 550 
NPH CMTC S. V A YNL TKNP 
THI A.P VLLP ITL L R I 
KPTL HDL FCC L.PY . E S 

CATAACTATC CCGCAAAAAG CAQGACTTAG CQCATACGAG ATGCTATATG 600 

. L S PKK QDLA HTR CYM 
HNYP PKS RT. PIRD AIW 
ITI PQKA GLS PYE MLYG 

GA1GGCCTTT GCTAACCAAT GAOCTIGTGC TTGACTGAGA AAIGGCCAAC 650 
DGLS -PM TLC LTEK WPT 
MAF PNQ. PCA L R NGQL 

WPF LTN D L V L D.E MAN 

TTAGTIGCAG ACATCACCTC CTTAGCCAAA TAICAACAAG TICTEAAAAC 700 
. L Q TSPP .PN INK FLKH 
SCR HHL LSQI STS S.N 
LVAD ITS LAK YQQV LKT 
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ATCACAQQGA AOCIGTOOQC GAGAGGAQGG AAAQGAACTA. TTOCACOCIG 750 

HRE PVP ERRE RNY STL 
ITGN LSP RGG KGTI PPW 
SQG TCPR EEG KEL FHPG 

GTGACAIG 758 
V T 
. H 
D M 
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1 

LISTE DE SEQUENCES 



(2) INFORMATIONS POUR LA SEQ ID NO: 68: 

(i) C ARACTER I ST I QUE S DE LA SEQUENCE : ' 
5 (A) LONGUEUR: 34 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS : simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

10 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 68: 

GACTCGCTGC AGATCGATTT TTTTTTTTTT TTTT 34 



(2) INFORMATIONS POUR LA SEQ ID NO: 69: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
15 (A) LONGUEUR: 30 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

2 0 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 69: 

GCCATCAAGC CACCCAAGAA CTCTTAACTT 30 



(2) INFORMATIONS POUR LA SEQ ID NO: 70: 

(i) CARACTERISTIQUES DE LA SEQUENCE: 
25 (A) LONGUEUR: 30 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

3 0 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 70: 

CCAATAGCCA GACCATTATA TACACTAATT 30 

(2) INFORMATIONS POUR LA SEQ ID NO: 112: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
35 ( A ) LONGUEUR: 310 paires de bases 

(B) TYPE: nucleotide 
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(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE : SEQ ID NO: 112: 
5 GCTTATAGAA GGACCCCTAG TATGGGGTAA TCCCCTCTGG GAAACCAAGC CCCAGTACTC 60 
AGCAGGAAAA AT AG AAT AG G AAACCTCACA AGGACATACT TTCCTCCCCT CCAGATGGCT 120 
AGCCACTGAG GAAGGAAAAA TACTTTCACC TGCAGCTAAC CAACAGAAAT TACTTAAAAC 180 
CCTTCACCAA ACCTTCCACT TAGGCATTGA TAGCACCCAT CAGATGGCCA AATTATTATT 240 
TACTGGACCA GGCCTTTTCA AAACTATCAA GAAGATAGTC AGGGGCTGTG AAGTGTGCCA 300 
1 0 AAG AAATAAT 310 

(2) INFORMATIONS POUR LA SEQ ID NO: 113: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
(A) LONGUEUR: 103 acides aminees 
15 (B) TYPE: acide aminee 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 113: 
20 Leu lie Glu Gly Pro Leu Val Trp Gly Asn Pro Leu Trp Glu Thr Lys 

15 10 15 

Pro Gin Tyr Ser Ala Gly Lys lie Glu Xaa Glu Thr Ser Gin Gly His 

20 25 30 

Thr Phe Leu Pro Ser Arg Trp Leu Ala Thr Glu Glu Gly Lys lie Leu 
25 35 40 45 

Ser Pro Ala Ala Asn Gin Gin Lys Leu Leu Lys Thr Leu His Gin Thr 

50 55 60 

Phe His Leu Gly lie Asp Ser Thr His Gin Met Ala Lys Leu Leu Phe 
65 70 75 80 

30 Thr Gly Pro Gly Leu Phe Lys Thr lie Lys Lys lie Val Arg Gly Cys 

85 90 95 

Glu Val Cys Gin Arg Asn Asn 
100 

35 (2) INFORMATIONS POUR LA SEQ ID NO: 114: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
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(A) LONGUEUR: 635 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
5 (ii) TYPE DE MOLECULE: ADNc 

(xl) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 114: 



CCCTGTATCT 


TTAACCTCCT 


TGTTAAGTTT 


GTCTCTTCCA 


GAATCAAAAC 


TGTAAAACTA 


60 


CAAATTGTTC 


TTCAAATGGA 


GCACCAGATG 


GAGTCCATGA 


CTAAGATCCA 


CCGTGGACCC 


120 


CTGGACCGGC 


CTGCTAGCCC 


ATGCTCCGAT 


GTTAATGACA 


TTGAAGGCAC 


CCCTCCCGAG 


180 


GAAATCTCAA 


CTGCACAACC 


CCTACTATGC 


CCCAATTCAG 


CGGGAAGCAG 


TTAGAGCGGT 


240 


CATCAGCCAA 


CCTCCCCAAC 


AGCACTTGGG 


TTTTCCTGTT 


GAGAGGGGGG 


ACTGAGAGAC 


300 


AGGACTAGCT 


GGATTTCCTA 


GGCCAACGAA 


GAATCCCTAA 


GCCTAGCTGG 


GAAGGTGACT 


360 


GCATCCACCT 


CTAAACATGG 


GGCTTGCAAC 


TTAGCTCACA 


CCCGACCAAT 


CAGAGAGCTC 


420 


ACT AAAATG C 


TAATTAGGCA 


AAAATAGGAG 


GTAAAGAAAT 


AGCCAATCAT 


CTATTG CCTG 


480 


AGAGCACAGC 


GGGAGGGACA 


AGGATCGGGA 


TATAAACCCA 


GGCATTCGAG 


CCGGCAACGG 


540 


CAACCCCCTT 


TGGGTCCCCT 


CCCTTTGTAT 


GGGCGCTCTG 


TTTTCACTCT 


ATTTCACTCT 


600 


ATTAAATCTT 


GCAACTGAAA 


AAAAAAAAAA 


AAAAA 






635 



(2) INFORMATIONS POUR LA SEQ ID NO: 115: 
20 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 77 acides aminees 

(B) TYPE: acide aminee 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
25 (ii) TYPE DE MOLECULE: peptide 

<xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 115: 
Pro Cys lie Phe Asn Leu Leu Val Lys Phe Val Ser Ser Arg lie Lys 
15 10 15 

Thr Val Lys Leu Gin lie Val Leu Gin Met Glu His Gin Met Glu Ser 
30 20 25 30 

Met Thr Lys lie His Arg Gly Pro Leu Asp Arg Pro Ala Ser Pro Cys 

35 40 45 

Ser Asp Val Asn Asp lie Glu Gly Thr Pro Pro Glu Glu lie Ser Thr 
50 55 60 

35 Ala Gin Pro Leu Leu Cys Pro Asn Ser Ala Gly Ser Ser 

65 70 75 
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(2) INFORMATIONS POUR LA SEQ ID NO: 116: 
(i) CARACTERI STIQUES DE LA SEQUENCE : 

(A) LONGUEUR: 32 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS : simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 116: 
TGGGGTTCCA TTTGTAAGAC CATCTGTAGC TT 

(2) INFORMATIONS POUR LA SEQ ID NO: 117: 
(i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 1481 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 



(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 117: 



ATGGCCCTCC 


CTTATCATAC 


TTTTCTCTTT 


ACTGTTCTCT 


TACCCCCTTT 


CGCTCTCACT 


60 


GCACCCCCTC 


CATGCTGCTG 


TACAACCAGT 


AGCTCCCCTT 


ACCAAGAGTT 


TCTATGAAGA 


120 


ACGCGGCTTC 


CTGGAAATAT 


TGATGCCCCA 


TCATATAGGA 


GTTTATCTAA 


GGGAAACTCC 


180 


ACCTTCACTG 


CCCACACCCA 


TATGCCCCGC 


AACTGCTATA 


ACTCTGCCAC 


TCTTTGCATG 


240 


CATGCAAATA 


CTCATTATTG 


GACAGGGAAA 


ATGATTAATC 


CTAGTTGTCC 


TGGAGGACTT 


300 


GGAGCCACTG 


TCTGTTGGAC 


TTACTTCACC 


CATACCAGTA 


TGTCTGATGG 


GGGTGGAATT 


360 


CAAGGTCAGG 


CAAGAGAAAA 


ACAAGTAAAG 


GAAGCAATCT 


CCCAACTGAC 


CCGGGGACAT 


420 


AGCACCCCTA 


GCCCCTACAA 


AGGACTAGTT 


CTCTCAAAAC 


TACATGAAAC 


CCTCCGTACC 


480 


CATACTCGCC 


TGGTGAGCCT 


ATTTAATACC 


ACCCTCACTC 


GGCTCCATGA 


GGTCTCAGCC 


540 


CAAAACCCTA 


CTAACTGTTG 


GATGTGCCTC 


CCCCTGCACT 


TCAGGCCATA 


CATTTCAATC 


600 


CCTGTTCCTG 


AACAATGGAA 


CAACTTCAGC 


ACAGAAATAA 


ACACCACTTC 


CGTTTTAGTA 


660 


GGACCTCTTG 


TTTCCAATCT 


GGAAATAACC 


CATACCTCAA 


ACCTCACCTG 


TGTAAAATTT 


720 


AGCAATACTA 


TAGACACAAC 


CAGCTCCCAA 


TGCATCAGGT 


GGGTAACACC 


TCCCACACGA 


780 


ATAGTCTGCC 


TACCCTCAGG 


AATATTTTTT 


GTCTGTGGTA 


CCTCAGCCTA 


TCATTGTTTG 


840 


AATGGCTCTT 


CAGAATCTAT 


GTGCTTCCTC 


TCATTCTTAG 


TGCCCCCTAT 


GACCATCTAC 


900 


ACTGAACAAG 


ATTTATACAA 


TCATGTCGTA 


CCTAAGCCCC 


A C AACAAAAG 


AGTACCCATT 


960 


CTTCCTTTTG 


JTATCAGAGC 


AGGAGTGCTA 


GG C AG ACT AG 


GT ACTGG CAT 


TGGCAGTATC 


1020 
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ACAACCTCTA 


CTCAGTTCTA 


CTACAAACTA 


TCTCAAGAAA 


TAAATGGTGA 


CATGGAACAG 


1080 






GTCACTGACT 


CCCTGGTCAC 


CTTGCAAGAT 


CAACTTAACT 


CCCTAGCAGC 


AGTAGTCCTT 


1140 






CAAAATCGAA 


GAGCTTTAGA 


CTTGCTAACC 


GCCAAAAGAG 


GGGGAACCTG 


TTTATTTTTA 


1200 


* 




GGAGAAGAAC 


GCTGTTATTA 


TGTTAATCAA 


TCCAGAATTG 


TCACTGAGAA 


AGTTAAAGAA 


1260 




5 


ATTCGAGATC 


GAATACAATG 


TAGAGCAGAG 


GAGCTTCAAA 


ACACCGAACG 


CTGGGGCCTC 


1320 






CTCAGCCAAT 


GGATGCCCTG 


GGTTCTCCCC 


TTCTTAGGAC 


CTCTAGCAGC 


TCTAATATTG 


1380 






TTACTCCTCT 


TTGGACCCTG 


TATCTTTAAC 


CTCCTTGTTA 


AGTTTGTCTC 


TTCCAGAATT 


1440 






GAAGCTGTAA 


AGCTACAGAT 


GGTCTTACAA 


ATGGAACCCC 


A 




1481 



10 (2) INFORMATIONS POUR LA SEQ ID NO: 118: 
(i) CARACTERISTIQUES DE LA SEQUENCE : 

(A) LONGUEUR: 493 acides aminees 

(B) TYPE: acide aminee 

(C) NOMBRE DE BRINS : simple 
15 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 118: 

Met Ala Leu Pro Tyr His Thr Phe Leu Phe Thr Val Leu Leu Pro Pro 
1 5 10 15 

20 Phe Ala Leu Thr Ala Pro Pro Pro Cys Cys Cys Thr Thr Ser Ser Ser 

20 25 30 

Pro Tyr Gin Glu Phe Leu Xaa Arg Thr Arg Leu Pro Gly Asn lie Asp 

35 40 45 

Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Asn Ser Thr Phe Thr Ala 
25 50 55 60 

His Thr His Met Pro Arg Asn Cys Tyr Asn Ser Ala Thr Leu Cys Met 
65 70 75 80 

His Ala Asn Thr His Tyr Trp Thr Gly Lys Met lie Asn Pro Ser Cys 
85 90 95 

30 Pro Gly Gly Leu Gly Ala Thr Val Cys Trp Thr Tyr Phe Thr His Thr 

100 105 110 

Ser Met Ser Asp Gly Gly Gly lie Gin Gly Gin Ala Arg Glu Lys Gin 

115 120 125 

Val Lys Glu Ala lie Ser Gin Leu Thr Arg Gly His Ser Thr Pro Ser 
35 130 135 140 

Pro Tyr Lys Gly Leu Val Leu Ser Lys Leu His Glu Thr Leu Arg Thr 
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145 
His 

GIu 

5 

His 
Phe 

10 Ser 
225 
Ser 

Pro 

15 

Gly 
Phe 

2 0 Leu 
305 
Leu 

lie 

25 

Glu 
Gin 

30 Ala 
385 
Gly 

Lye 

35 

Gin 




Thr Arg Leu 

Val Ser Ala 
180 

Phe Arg Pro 

195 
Ser Thr Glu 
210 

Asn Leu Glu 

Asn Thr lie 

Pro Thr Arg 
260 

Thr Ser Ala 

275 
Leu Ser Phe 
290 

Tyr Asn His 

Pro Phe Val 

Gly Ser lie 
340 

lie Asn Gly 

355 
Asp Gin Leu 
370 

Leu Asp Leu 

Glu Glu Arg 

Val Lys Glu 
420 

Asn Thr Glu 



150 
Val Ser Leu 
165 

Gin Asn Pro 

Tyr lie Ser 

lie Asn Thr 
215 

lie Thr His 

230 
Asp Thr Thr 
245 

He Val Cys 

Tyr His Cys 

Leu Val Pro 
295 

Val Val Pro 

310 

He Arg Ala 
325 

Thr Thr Ser 

Asp Met Glu 

Asn Ser Leu 
375 

Leu Thr Ala 

390 
Cys Tyr Tyr 
405 

He Arg Asp 
Arg Trp Gly 



6 

Phe Asn Thr 
170 

Thr Asn Cys 

185 
lie Pro Val 
200 

Thr Ser Val 

Thr Ser Asn 

Ser Ser Gin 
250 

Leu Pro Ser 

265 
Leu Asn Gly 
280 

Pro Met Thr 

Lys Pro His 

Gly Val Leu 
330 

Thr Gin Phe 

345 
Gin Val Thr 
360 

Ala Ala Val 

Lys Arg Gly 

Val Asn Gin 
410 

Arg He Gin 

425 
Leu Leu Ser 




155 

Thr Leu Thr 

Trp Met Cys 

Pro Glu Gin 
205 

Leu Val Gly 

220 
Leu Thr Cys 
235 

Cys He Arg 

Gly He Phe 

Ser Ser Glu 
285 

He Tyr Thr 

300 
Asn Lys Arg 
315 

Gly Arg Leu 

Tyr Tyr Lys 

Asp Ser Leu 
365 

Val Leu Gin 

380 
Gly Thr Cys 
395 

Ser Arg He 
Cys Arg Ala 
Gin Trp Met 
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160 

Arg Leu His 

175 
Leu Pro Leu 
190 

Trp Asn Asn 

Pro Leu Val 

Val Lys Phe 
240 

Trp Val Thr 

255 
Phe Val Cys 
270 

Ser Met Cys 

Glu Gin Asp 

Val Pro He 
320 

Gly Thr Gly 

335 
Leu Ser Gin 
350 

val Thr Leu 

Asn Arg Arg 

Leu Phe Leu 
400 

Val Thr Glu 

415 
Glu Glu Leu 
430 

Pro Trp Val 
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435 440 445 

Leu Pro Phe Leu Gly Pro Leu Ala Ala Leu lie Leu Leu Leu Leu Phe 

450 455 460 

Gly Pro Cys lie Phe Asn Leu Leu Val Lys Phe Val Ser Ser Arg lie 
5 465 470 475 480 

Glu Ala Val Lys Leu Gin Met Val Leu Gin Met Glu Pro 
485 490 

(2) INFORMATIONS POUR LA SEQ ID NO: 119: 
10 (i) CARACTERISTIQUES DE LA SEQUENCE : 

(A) LONGUEUR: 32 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS : simple 
<D) CONFIGURATION : lineaire 

15 (ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 119: 
TCAAAATCGA AGAGCTTTAG ACTTGCTAAC CG 32 

(2) INFORMATIONS POUR LA SEQ ID NO: 120: 

2 0 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 1329 paires de bases 

(B) TYPE: nucleotide 

<C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
25 (ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 120: 

TCAAAATCGA AGAGCTTTAG ACTTGCTAAC CGCCAAAAGA GGGGGAACCT GTTTATTTTT 60 

AGGGGAAGAA TGCTGTTAGT ATGTTAATCA ATCTGGAATC ATTACTGAGA AAGTTAAAGA 120 

AATTTG AG AT CGAATATAAT G TAG AG C AG A GGACCTTCAA AACACTGCAC CCTGGGGCCT 180 

3 0 CCTCAGCCAA TGGATGCCCT GGACTCTCCC CTTCTTAGGA CCTCT AG C AG CTATAATATT 240 

TTTACTCCTC TTTGGACCCT GTATCTTCAA CTTCCTTGTT AAGTTTGTCT CTTCCAGAAT 300 

TGAAGCTGTA AAGCTACAAA TAGTTCTTCA AATGGAACCC CAGATGCAGT CCATGACTAA 360 

AATCTACCGT GGACCCCTGG ACCGGCCTGC TAGACTATGC TCTGATGTTA ATGACATTGA 420 

AGTCACCCCT CCCGAGGAAA TCTCAACTGC ACAACCCCTA CTACACTCCA ATTCAGTAGG 480 

35 AAGCAGTTAG AGCAGTTGTC AGCCAACCTC CCCAACAGTA CTTGGGTTTT CCTGTTGAGA 540 

GGGTGGACTG AGAGACAGGA CTAGCTGGAT TTCCTAGGCT GACTAAGAAT CCCNAAGCCT 600 
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ANCTGGGAAG 


GTGACCGCAT 


CCATCTTTAA 


ACCAATCAGA 


GAGCTCACTA 


AAATG CTAAT 


AATCATCTAT 


TGCCTGAGAG 


CACAGCGGGA 


TTCAAGCCAG 


CAACAGCAAC 


CCCCTTTGGG 


CACTCTATTT 


CACTCTATTA 


AATCATGCAA 


CTCAAGCTGA 


GCTTTTGTTC 


GCCATCCACC 


GCTGACTTCC 


ATCCCTTTGG 


ATCCAGCAGA 


ACCCATTGCC 


ACTCCCGATC 


AGGCTAAAGG 


TGGGTTTGTC 


CTAATAGAAC 


TGAACACTGG 


CCACGGCTTC 


TAATAGAGCT 


ATAACACTCA 


TCTGTGAGGC 


CAAGAACCCC 


AGGTCAGAGA 


CCCACTGCCA 


TTTTGGTAGC 


GGCCCACCAC 


CCAGTAACA 







8 

ACATGGGGCT TGCAACTTAG CTCACACCCG 660 

CAGGCAAAAA CAGGAGGTAA .AGCAATAGCC 720 

AGGACAAGGA TTGGGATATA AACTCAGGCA 780 

TCCCCTCCCA TTGTATGGGA GCTCTGTTTT 840 

CTGCACTCTT CTGGTCCGTG TTTTTTATGG 900 

ACTGCTGTTT GCCACCGTCA CAGACCCGCT 960 

GTGTCCACTG TGCTCCTGAT CCAGCGAGGT 1020 

CTTGCCATTG TTCCTGCATG GCTAAGTGCC 1080 

TCACTGGGTT CCATGGTTCT CTTCCATGAC 1140 

CCGCATGGCC CAAGATTCCA TTCCTTGGTA 1200 

ANGTGAGGCT TGCCACCATT TGGGAAGTGG 12 60 

CATCTTGGGA GCTGTGGGAG CAAGGATCCC 1320 

1329 



15 (2) INFORMATIONS POUR LA SEQ ID NO: 121: 
(i) CARACTERI STIQUES DE LA SEQUENCE : 

(A) LONGUEUR: 162 acides amines 

(B) TYPE: acide amine 

(C) NOMBRE DE BRINS : simple 
20 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE : SEQ ID NO: 121: 
Gin Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr 
15 10 15 

25 Cys Leu Phe Leu Gly Glu Glu Cys Cys Xaa Tyr Val Asn Gin Ser Gly 

20 25 30 

lie lie Thr Glu Lys Val Lys Glu lie Xaa Asp Arg lie Xaa Cys Arg 

35 40 45 

Ala Glu Asp Leu Gin Asn Thr Ala Pro Trp Gly Leu Leu Ser Gin Trp 
30 50 55 60 

Met Pro Trp Thr Leu Pro Phe Leu Gly Pro Leu Ala Ala lie lie Phe 
65 70 75 80 

Leu Leu Leu Phe Gly Pro Cys lie Phe Asn Phe Leu Val Lys Phe Val 
85 90 95 

35 Ser Ser Arg lie Glu Ala Val Lys Leu Gin He Val Leu Gin Met Glu 

100 105 110 
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Pro Gin Met Gin Ser Met Thr Lys lie Tyr Arg Gly Pro Leu Asp Arg 

115 120 125 - 

Pro Ala Arg Leu Cys Ser Asp Val Asn Asp lie Glu Val Thr Pro Pro 
130 135 140 

5 Glu Glu He Ser Thr Ala Gin Pro Leu Leu His Ser Asn Ser Val Gly 

145 150 155 160 

Ser Ser 



(2) INFORMATIONS POUR LA SEQ ID NO: 122: 
10 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 21 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
15 (ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 122: 
GGCATTGATA GCACCCATCA G 21 



(2) INFORMATIONS POUR LA SEQ ID NO: 12 3: 
20 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 21 paires de bases 

(B) TYPE: nucleotide 

<C) NOMBRE DE BRINS: simple 
(D) CONFIGURATION: lineaire 
25 (ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 123: 
CATGTCACCA GGGTGGAATA G 



(2) INFORMATIONS POUR LA SEQ ID NO: 124: 
3 0 (i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
35 (ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 124: 
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10 



5 



10 



15 



20 



25 



30 



T 



(2) INFORMATIONS POUR LA SEQ ID NO: 124: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 758 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 



(xi) DESCRIPTION DE LA SEQUENCE : SEQ ID NO: 124: 



GGCATTGATA 


GCACCCATCA 


GATGGCCAAA 


TCATTATTTA 


CTGGACCAGG 


CCTTTTCAAA 


60 


ACTATCAAGC 


AGATAGGGCC 


CGTGAAGCAT 


GCCAAAGAAA 


TAATCCCCTG 


CCTTATCGCC 


120 


ATGTTCCTTC 


AGGAGAACAA 


AGAACAGGCC 


ATTACCCAGG 


GGAAGACTGG 


CAACTAGATT 


180 


TTACCCACAT 


GGCCAAATGT 


CAGGGATTTC 


AGCATCTACT 


AGTCTGGGCA 


GATACTTTCA 


240 


CTGGTTGGGT 


GGAGTCTTCT 


CCTTGTAGGA 


CAGAAAAGAC 


CCAAGAGGTA 


ATAAAGGCAC 


300 


TAATGAAATA 


ATTCCCAGAT 


TTGGACTTCC 


CCCAGGATTA 


CAGGGTGACA 


ATGGCCCCGC 


360 


TTTCAAGGCT 


GCAGTAACCC 


AGGGAGTATC 


CCAGGTGTTA 


GGCATACAAT 


ATCACTTACA 


420 


CTGTGCCTGG 


AGGCCACAAT 


CCTCCAGAAA 


AGTCAAGAAA 


ATGAATGAAA 


CACTCAAAGA 


480 


TCTAAAAAAG 


CTAACCCAAG 


AAACCCACAT 


TGCATGACCT 


GTTCTGTTGC 


CTATAACCTT 


540 


ACTAAGAATC 


CATAACTATC 


CCCCAAAAAG 


CAGGACTTAG 


CCCATACGAG 


ATGCTATATG 


600 


GATGGCCTTT 


CCTAACCAAT 


GACCTTGTGC 


TTGACTGAGA 


AATGGCCAAC 


TTAGTTGCAG 


660 


ACATCACCTC 


CTTAGCCAAA 


TATCAACAAG 


TTCTTAAAAC 


ATCACAGGGA 


ACCTGTCCCC 


720 


GAGAGGAGGG 


AAAGGAACTA 


TTCCACCCTG 


GTGACATG 






758 



(2) INFORMATIONS POUR LA SEQ ID NO: 126: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 25 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNC 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 126: 
CGGACATCCA AAGTGATGGG AAACG 

(2) INFORMATIONS POUR LA SEQ ID NO: 127: 
- (i) CARACTERISTIQUES DE LA SEQUENCE: 
(A) LONGUEUR: 26 paires de bases 
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(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 

* (ii) TYPE DE MOLECULE: ADNc 

5 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 127: 

* GGACAGGAAA GTAAGACTGA GAAGGC 



(2) INFORMATIONS POUR LA SEQ ID NO: 128: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
10 (A) LONGUEUR: 26 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

15 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 128: 

CCTAGAACGT ATTCTGGAGA ATTGGG 



(2) INFORMATIONS POUR LA SEQ ID NO: 129: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
20 (A) LONGUEUR: 26 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

25 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 129: 

TGGCTCTCAA TGGTCAAACA TACCCG 



(2) INFORMATIONS POUR LA SEQ ID NO: 130: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
30 (A) LONGUEUR: 1511 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

35 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 130: 

CCTAGAACGT ATTCTGGAGA ATTGGG ACCA ATGTGACACT CAGACGCTAA GAAAGAAACG 
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ATTTATATTC 


TTCTGCAGTA 


CCGCCTGGCC 


ACAATATCCT 


CTTCAAGGGA 


Tv tv TV ^ /"* 

GAGAAACCTG 


1 *") Ci 
1ZU 




GCTTCCTGAG 


GGAAGTATAA 


ATTATAACAT 


CATCTTACAG 


CTAGACCTCT 


TCTGTAGAAA 


l on 
loU 




GGAGGGCAAA 


TGGAGTGAAG 


TGCCATATGT 


GCAAACTTTC 


TTTTCATTAA 


TV TV TV TV 

GAGACAACTC 






ACAATTATGT 


AAAAAGTGTG 


GTTTATGCCC 


TACAGGAAGC 


CCTCAGAGTC 


CACCTCCC1 A 


JUL) 


5 


CCCCAGCGTC 


CCCTCCCCGA 


CTCCTTCCTC 


AACTAATAAG 


GACCCCCCTT 


TAACCCAAAL. 


*3 r\ 




GGTCCAAAAG 


GAGATAGACA 


AAGGGGTAAA 


CAATGAACCA 


AAGAGTGCCA 


A 1 Ail CCUU-l? 






ATTATGCCCC 


CTCCAAGCAG 


TGAGAGGAGG 


AGAATTCGGC 




TPr , n , rr"Pi Of* 


/on 
** O w 




TTTTTCTCTC 


TCAGACTTAA 


AGCAAATTAA 


ts m tv tv ^*fT1 TV 

AATAGACCTA 


y-t /-t m t\ ft J\ rprp /^rp 

GGTAAAi XC1 


V^AtsA 1 AAt^L.1^ 


O ** \J 




TGACGGCTAT 


ATTGATGTTT 


TACAAGGGTT 


AGGACAATCC 


TTTGATCTGA 


CATGGAGAbA 


OUVJ 


10 


TATAATGTTA 


CTACTAAATC 


AGACACTAAC 


CCCAAATGAG 


AGAAGTGCCG 


CT. GIAAL1GL 


a An 
oou 




AGCCCGAGAG 


TTTGGCGATC 


TTTGGTATCT 


CAGTCAGGCC 


AACAATAGGA 


fn /^l TV TV TV TV TV 

TGACAACAGA 


"7 O A 




GGAAAGAACA 


ACTCCCACAG 


GCCAGCAGGC 


AGTTCCCAGT 


GTAGACCCTC 


ATTGGGACAC 


7oU 




AGAATCAGAA 


CATGGAGATT 


GGTGCCACAA 


ACATTTGCTA 


ACTTGCGTGC 


TAGAAGGACT 






GAGGAAAACT 


AGGAAGAAGC 


CTATGAATTA 


CTCAATGATG 


m y*» t\ m tv m tv tv 

TCCACTATAA 


/— V TV /-» TV ^ TV Tv TV 

CACAGGGAAA 
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GGAAGAAAAT 


CTTACTGCTT 


TTCTGGACAG 


ACTAAGGGAG 


GCATTGAGGA 


AG CATACt I 


you 




CCTGTCACCT 


GACTCTATTG 


AAGGCCAACT 


AATCTTAAAG 


GATAAGTTTA 


rp 7\ /"« T« /"* TA T 1 f 

i L- At* 1 LAb 1 


i n*? n 

-L W <c VJ 




AGCTGCAGAC 


AT TAG AAAAA 


ACTTCAAAAG 


TCTGCCTTAG 


GCCCGGAGCA 


/-i Tv Tv /TT 1 rri TV 7\ T\ 


i non 




ACCCTATTTA 


ACTTGGCATC 


CTCAGTTTTT 


m tv m T* T\ rrt Tv Tv 

TATAATAGAG 


A 1 L-.ALjLtAvjLj A 




X X ** w 




CGGGACAAAC 


GGG AT AAAAA 


AAAAAGGGGG 


GGTCCACTAC 


TTTAGTCATG 


GCCCTCAGGC 


1200 


20 


AAGCAGACTT 


TGGAGGCTCT 


G C AAA AGG G A 


AAAGCTGGGC 


AAATCAAATG 


CCTAATAGGG 


1260 




CTGGCTTCCA 


GTGCGGTCTA 


CAAGGACACT 


TTAAAAAAGA 


TTATCCAAGT 


AGAAATAAGC 


1320 




CGCCCCCTTG 


TCCATGCCCC 


TTACGTCAAG 


GGAATCACTG 


GAAGGCCCAC 


TGCCCCAGGG 


1380 




GATGAAGATA 


CTCTGAGTCA 


GAAGCCATTA 


ACCAGATGAT 


CCAGCAGCAG 


GACTGAGGGT 


1440 




GCCCGGGGCG 


AGCGCCAGCC 


CATGCCATCA 


CCCTCACAGA 


GCCCCGGGTA 


TGTTTGACCA 


1500 


25 


TTGAGAGCCA 


A 










1511 



(2) INFORMATIONS POUR LA SEQ ID NO: 131: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
(A) LONGUEUR: 352 acides amines 
3 0 (B) TYPE: acide amine 

* (C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
r (ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 131: 
35 Leu Glu Arg lie Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu 

15 10 15 
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Arg Lys Lys Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr 

20 25 30 

Pro Leu Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr 
35 40 45 

5 Asn lie lie Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp 

50 55 60 

Ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser 
65 70 75 80 

Gin Leu Cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser 
10 85 90 95 

Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn 

100 105 110 

Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lys Glu lie Asp Lys Gly 
115 120 125 

15 Val Asn Asn Glu Pro Lys Ser Ala Asn lie Pro Arg Leu Cys Pro Leu 

130 135 140 

Gin Ala Val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro 
145 150 155 160 

Phe Ser Leu Ser Asp Leu Lys Gin lie Lys lie Asp Leu Gly Lys Phe 
20 165 170 175 

Ser Asp Asn Pro Asp Gly Tyr lie Asp Val Leu Gin Gly Leu Gly Gin 

180 185 190 

Ser Phe Asp Leu Thr Trp Arg Asp lie Met Leu Leu Leu Asn Gin Thr 
195 200 205 

25 Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe 

210 215 220 

Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu 
225 230 235 240 

Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro 
30 245 250 255 

His Trp Asp Thr Glu Ser Glu His Gly Asp Trp Cys His Lys His Leu 

260 265 270 

Leu Thr Cys Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met 
275 280 285 

35 Asn Tyr Ser Met Met Ser Thr lie Thr Gin Gly Lys Glu Glu Asn Leu 

290 295 300 
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Thr Ala Phe Leu Asp Arg Leu Arg 
305 310 
Leu Ser Pro Asp Ser lie Glu Gly 
325 

5 lie Thr Gin Ser Ala Ala Asp lie 

340 



Glu Ala Leu Arg Lys His Thr Ser 
315 320 
Gin Leu lie Leu Lys Asp Lys Phe 

330 335 
Arg Lys Asn Phe Lys Ser Leu Pro 
345 350 



(2) INFORMATIONS POUR LA SEQ ID NO: 132: 
<i) CARACTERISTIQUES DE LA SEQUENCE: 
10 (A) LONGUEUR: 30 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

15 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 132: 

TGCTGGAATT CGGGATCCTA GAACGTATTC 30 



(2) INFORMATIONS POUR LA SEQ ID NO: 133: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
20 (A) LONGUEUR: 30 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

25 (xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 133: 

AGTTCTGCTC CGAAGCTTAG GCAGACTTTT 30 

(2) INFORMATIONS POUR LA SEQ ID NO: 13 5: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 
30 (A) LONGUEUR: 398 acides amines 

(B) TYPE: acide amine 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: peptide 

35 (Xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO:" 135: 

Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 
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15 10 15 

Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 

20 25 30 

lie Leu Glu Arg lie Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr 
5 35 40 45 

Leu Arg Lys Lys Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin 

50 55 60 

Tyr Pro Leu Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn 
65 70 75 80 

10 Tyr Asn lie lie Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys 

85 90 95 

Trp Ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn 

100 105 110 

Ser Gin Leu Cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin 
15 115 120 125 

Ser Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr 

130 135 140 

Asn Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lys Glu lie Asp Lys 
145 150 155 160 

20 Gly Val Asn Asn Glu Pro Lys Ser Ala Asn lie Pro Arg Leu Cys Pro 

165 170 175 

Leu Gin Ala Val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val 

180 185 190 

Pro Phe Ser Leu Ser Asp Leu Lys Gin lie Lys lie Asp Leu Gly Lys 
25 195 200 205 

Phe Ser Asp Asn Pro Asp Gly Tyr lie Asp Val Leu Gin Gly Leu Gly 

210 215 220 

Gin Ser Phe Asp Leu Thr Trp Arg Asp lie Met Leu Leu Leu Asn Gin 
225 230 235 240 

30 Thr Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu 

245 250 255 

Phe Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr 

260 265 270 

Glu Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp 
35 275 280 285 

Pro His Trp Asp Thr Glu Ser Glu His Gly Asp Trp Cys His Lys His 



FEU1LLE DE REMPLACEMENT (REGLE 26) 



WO 99/02666 PCT/FR98/01 460 



16 

290 295 300 

Leu Leu Thr Cys Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro 
305 310 315 320 

Met Asn Tyr Ser Met Met Ser Thr lie Thr Gin Gly Lys Glu Glu Asn 
5 325 330 335 

Leu Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr 

340 345 350 

Ser Leu Ser Pro Asp Ser lie Glu Gly Gin Leu lie Leu Lys Asp Lys 
355 360 365 

10 Phe lie Thr Gin Ser Ala Ala Asp lie Arg Lys Asn Phe Lys Ser Leu 

370 375 380 

Pro Lys Leu Ala Ala Ala Leu Glu His His His His His His 
385 390 395 

15 (2) INFORMATIONS POUR LA SEQ ID NO: 13 7: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 378 acides amines 

(B) TYPE: acide amine 

(C) NOMBRE DE BRINS : simple 
20 (D) CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE : SEQ ID NO: 137: 
Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Leu Glu Arg 
15 10 15 

25 lie Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys 

20 25 30 

Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin 

35 40 45 

Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr Asn lie lie 
30 50 55 60 

Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val 
65 70 75 80 

Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys 
85 90 95 

35 Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro 

100 105 110 
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15 



20 



25 



30 

) 

35 

(2) 



Tyr Pro Ser Val 
115 

Pro Leu Thr Gin 
130 

Glu Pro Lys Ser 
145 

Arg Gly Gly Glu 

Ser Asp Leu Lys 
180 

Pro Asp Gly Tyr 
195 

Leu Thr Trp Arg 
210 

Asn Glu Arg Ser 
225 

Trp Tyr Leu Ser 

Thr Pro Thr Gly 
260 

Thr Glu Ser Glu 
275 

Val Leu Glu Gly 
290 

Met Met Ser Thr 
305 

Leu Asp Arg Leu 

Asp Ser He Glu 
340 

Ser Ala Ala Asp 
355 

Ala Ala Leu Glu 
370 

INFORMATIONS POU 
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Pro Ser Pro Thr 
120 

Thr Val Gin Lys 
135 

Ala Asn He Pro 
150 

Phe Gly Pro Ala 
165 

Gin He Lys He 

He Asp Val Leu 
200 

Asp He Met Leu 
215 

Ala Ala Val Thr 
230 

Gin Ala Asn Asn 
245 

Gin Gin Ala Val 

His Gly Asp Trp 
280 

Leu Arg Lys Thr 
295 

He Thr Gin Gly 
310 

Arg Glu Ala Leu 
325 

Gly Gin Leu He 

He Arg Lys Asn 
360 

His His His His 
375 

R LA SEQ ID NO: 



Pro Ser Ser Thr 

Glu He Asp Lys 
140 

Arg Leu Cys Pro 
155 

Arg Val Pro Val 
170 

Asp Leu Gly Lys 
185 

Gin Gly Leu Gly 

Leu Leu Asn Gin 
220 

Ala Ala Arg Glu 
235 

Arg Met Thr Thr 
250 

Pro Ser Val Asp 
265 

Cys His Lys His 

Arg Lys Lys Pro 
300 

Lys Glu Glu Asn 
315 

Arg Lys His Thr 
330 

Leu Lys Asp Lys 
345 

Phe Lys Ser Leu 
His His 



138: 



Asn Lys Asp Pro 
125 

Gly Val Asn Asn 

Leu Gin Ala Val 
160 

Pro Phe Ser Leu 
175 

Phe Ser Asp Asn 
190 

Gin Ser Phe Asp 
205 

Thr Leu Thr Pro 

Phe Gly Asp Leu 
240 

Glu Glu Arg Thr 
255 

Pro His Trp Asp 
270 

Leu Leu Thr Cys 
285 

Met Asn Tyr Ser 

Leu Thr Ala Phe 
320 

Ser Leu Ser Pro 
335 

Phe He Thr Gin 
350 

Pro Lys Leu Ala 
365 
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ft 

10 
15 
20 
25 
30 

X 

35 



(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 25 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 138: 
CTTGGAGGGT GCATAACCAG GGAAT 2 5 

(2) INFORMATIONS POUR LA SEQ ID NO: 139: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 20 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(Xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 139: 
TGTCCGCTGT GCTCCTGATC 20 

(2) INFORMATIONS POUR LA SEQ ID NO: 140: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 2 5 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 140: 
CTATGTCCTT TTGGACTGTT TGGGT 25 

(2) INFORMATIONS POUR LA SEQ ID NO: 141: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 764 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
(ii) TYPE DE MOLECULE: ADNc 
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5 

i 

10 
15 
20 
25 
30 

V 

35 



(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 141: 



TGTCCGL X x 








TCTCCCAATT 


GGGCTAAAGG 


60 


CTTGCCAT I \j 




X nnu X Vj 




CTAATCGAGC 


TGAACACTAG 


120 


TCACTGGGTT 


CCACGGTTCT 


CI I CCA 1 tiAl- 




TaaT&fJAfiPT 

InninVjnvjv^ x 


ATAAPAPTPA 


1 AO 

X O \J 


CTGCATGGTC 






TPPfJTCAflAP 


PAAGAACCCC 


AGGTCAGAGA 


240 


ACACAACjGL 1 




TTPr TV 7\ f"> /~» 7\ /-" , 
1 J. VjvjrVrVvj v^-/-lvj 




TTTTGGAAGC 


AGCCCGCCAC 


300 


TATCTTGGGA 


GCTCTGGGAG 


CAAGGACCCC 


AGGTAACAAT 


TTGGTGACCA 


CGAAGGGACC 


360 


TGAATCCGCA 


ACCATGAAGG 


GATCTCCAAA 


GCAATTGGAA 


ATGTTCCTCC 


CAAGGCAAAA 


420 


ATGCCCCTAA 


GATGTATTCT 


GGAGAATTGG 


GACCAATTTG 


ACCCTCAGAC 


AGTAAGAAAA 


480 


AAATGACTTA 


TATTCTTCTG 


CAGTACCGCC 


CTGGCCACGA 


TATCCTCTTC 


AAGGGGGAGA 


540 


AACCTGGCCT 


CCTGAGGGAA 


GTATAAATTA 


TAACACCATC 


TTACAGCTAG 


ACCTGTTTTG 


600 


TAGAAAAGGA 


GGCAAATGGA 


GTGAAGTGCC 


ATATTTACAA 


ACTTTCTTTT 


CATTAAAAGA 


660 


CAACTCGCAA 


TTATGTTAAC 


AGTGTGATTT 


GTGTTCCTAC 


ACGGAAGCCC 


TCAGATTCTA 


720 


CTCCCCACCC 


CCGGCATCTC 


CCCTGAATCC 


CTCCCCAACT 


TATT 




764 



(2) INFORMATIONS POUR LA SEQ ID NO: 142: 
(i) CARACTERISTIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 800 paires de bases 

(B) TYPE: nucleotide 

(C) NOMBRE DE BRINS: simple 

(D) CONFIGURATION: lineaire 
<ii) TYPE DE MOLECULE: ADNc 



(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 142: 



TGTCCGCTGT 


GCTCCTGATC 


CAGCACAGGC 


GCCCATTGCC 


TCTCCCAATT 


GGGCTAAAGG 


60 


CTTGCCATTG 


TTCCTGCACA 


GCTAAGTGCC 


TGGGTTCATC 


CTAATCGAGC 


TGAACACTAG 


120 


TCACTGGGTT 


CCACGGTTCT 


CTTCCATGAC 


CCATGGCTTC 


TAATAGAGCT 


ATAACACTCA 


180 


CTGCATGGTC 


CAAGATTCCA 


TTCCTTGGAA 


TCCGTGAGAC 


CAAGAACCCC 


AGGTCAGAGA 


240 


ACACAAGGCT 


TGCCACCATG 


TTGGAAGCAG 


CCCACCACCA 


TTTTGGAAGC 


GGCCCGCCAC 


300 


TATCTTGGGA 


GCTCTGGGAG 


CAAGGACCCC 


CAGGTAACAA 


TTTGGTGACC 


ACGAAGGGAC 


360 


CTGAATCCGC 


AACCATGAAG 


GGATCTCCAA 


AGCAATTGGA 


AATGTTCCTC 


CCAAGGCAAA 


420 


AATGCCCCTA 


AGATGTATTC 


TGGAGAATTG 


GGACCAATCT 


GACCCTCAGA 


CAGTAAGAAA 


480 


AAAAATGACT 


TATATTCTTC 


TGCAGTACCG 


CCTGGCCACG 


GATATCCTCT 


TCAAGGGGGA 


540 


GAAACCTGGC 


CTCCTGAGGG 


AAGTATAAAT 


TATAACACCA 


TCTTACAGCT 


AGACCTGTTT 


600 


TGTAGAAAAG 


GAGGCAAATG 


GAGTGAAGTG 


CCATATTTAC 


AAACTTTCTT 


TTCATTAAAA 


660 


GACAACTCGC 


AATTATGTAA 


ACAGTGTGAT 


TTGTGTCCTA 


CAGGAAGCCC 


TCAGATCTAC 


720 


CTCCCTACCC 


CGGCATCTCC 


CTGACTCCTT 


CCCCAACTAA 


TAAGGACCCA 


CTTCAGCCCA 


780 
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800 
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